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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 

(i) APPLICANT: Cech, Thomas R . 

Lingner, Joachim 
Nakamura, Toru 
Chapman 7 Karen B. 
Morin, Gregg B. 
Harley, Calvin B, 
Andrews, William H. 

(ii) TITLE OF INVENTION: Human Telomerase Catalytic Subunit 

(iii) NUMBER OF SEQUENCES: 727 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Townsend and Townsend and Crew LLP 

(B) STREET: Two Embarcadero Center, Eighth Floor 

(C) CITY: San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 94111-3834 

(v) COMPUTER READABLE FORM:. 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

*{C) OPERATING SYSTEM: PC -DOS /MS -DOS 
{ D ) SOFTWARE: Patentln Release #1.0, Version SI. 30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/974,549 

(B) FILING DATE: 19-NOV-1997 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/7 24,64 3 

(B) FILING DATE: 01-OCT-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/844,419 

(B) FILING DATE: 18-APR-1997 

(vii) PRIOR' APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/846,017 
(3) FILING DATE: 25-APR-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUM3ER: US 03/851,843 

(B) FILING DATE: 06-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER : US 08/354,050 
{ B ) FILING DATE: 09-MAY-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/911,312 

(B) FILING DATE: 14-AUG-1997 
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{vii) PRIOR APPLICATION DATA: ■ 

(A) APPLICATION NUMBER: US 08/912,951 
{ B) FILING DATE: 14 -AUG-1.997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/915,503 

(B) FILING DATE: 14-AUG-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/US97/17 618 

(B) FILING DATE: 01-OCT-1997 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: WO PCT/US97 /17 88 5 

(B) FILING DATE: Oi-OCT-1997 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Apple, Randolph Ted 

(B) REGISTRATION NUMBER: 36,429 

(C) REFERENCE /DOCKET NUMBER: 0 1538 9-002 6 10US 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (415) 576-0200 

(B) TELEFAX: (415) 576-0300 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4015 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAN DE.DNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : cDNA 



(ix) FEATURE: -„ 

(A) . NAME /KEY : CDS 

(B) LOCATION: 56 . . 3454 

(D) OTHER INFORMATION: /product— "hTRT" 

/note- "human telomerase reverse 
transcriptase ( hTRT ) catalytic protein 
comoonent" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:1: 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 58 

Met 
1 

CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC 10 6 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 
5 10 ■ 15 

TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 15 4 

Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 
20 25 30 
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CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 202 

Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 

35 40 " 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 250 

Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 

50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 298 

Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 

70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 34 6 

Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 

85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG . 394 

Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 

100 105 110 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC . 44 2 

Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro.hsn Thr Val Thr Asp 

115 120 " 125 

GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC 4 90 

Ala Leu Arg Gly Ser Gly Ala Trp Gly Leo Leu Leu Arg Arg Val Gly 

120 135 140 145 

GAC GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 538 

Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 

150 155 160 

GTG OCT CCC AGC TGC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 58 6 

Vsi Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 

165 ' 170 " 1-7 5 

CTC GGC GCT GCC ACT CAG GCC CGG CCC CCG CCA CAC GCT AG? GGA CCC "634 

Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 

180 - 1S5 1 SO 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 682 

Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 

195 J 200 ' ' 205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 7 30 

Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arc Arg Gly 

210 215 220 . 225 

GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 7 78 

Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arc Arg Gly 

230 235 240 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 8 26 

Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 

245 250 255 

CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT GTG GTG 874 

His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 

260 265 " 270 
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TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 922 

Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 

275 280 285 

TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 97 0 

Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 

290 295 . 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAG ACG CCT TGT 1018 

Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 

310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAC TCC TCA GGC GAC 1066 

Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 

325 330 335 

AAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 

Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser 

340 345 350 

CTG ACT GGC GCT CGG AGG CTC GTG GAG ACC ATC TTT CTG GGT TCC AGG 1162 

Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser Arg 

355 360 ' 365 

CCC * TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 

Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 

370 375 . 380 " 385 

TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG 12 58 

Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 

390 395 400 

TAG TGC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 1306 

Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 

405 410 415 

TTG GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG AAG CCC CAG GGC T3 54- 

Als'Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly'. 

4 20 4 25 4 30 

TCT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG . 1402 

Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 

435 440 445 



CTG CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 14 50 

Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 

450 455 460 4 65 

CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 14 98 

Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 

470 4 7 5 480 

CAC AAC GAA CGC CGC' TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC CTG 154 6 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser Leu 

485 4 90 4 95 

GTG AAG CAT GCC AAG CTC TCG CTG CAG GAG" CTG ACG TGG AAG ATG AGC 1594 

Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 

500 505 510 
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GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT 164 2 

Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
515 520 525 

CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 1690 
Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe Leu 
530 535 540 545 

CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG CTC AGG TCT TTC TTT 17 38 

His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG CTC TTT TTC TAC CGG 17 8 6 

Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
565 570 575 

AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AGA CAG CAC ,TTG 1834 
Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu 
580 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 18 82 

Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His ■ 
595 '600 605 

CGG GAA GCC AGG CCC GCC CTG CTG ACG TCC AGA CTC CGC TTC ATC CCC 1930 
Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro 
610 615 620 625 

AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 197 8 

lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly 
630 635 640 

GCC AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG 202 6 

Ala. Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
645 ^ 650 - 655 

GTG AAG GCA CTG TTC AGC GTG CTC A-AC TAC GAG CGG GCG CGG CGC CCC 207 4 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
660 665 670 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 2122 
Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala 
675 680 685 

TGG CGC ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 217 0 

Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
690 695 700. 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 2218 
Leu Tyr Phe Val' Lys Val Asp Val Thr Gly Ala Tyr Asp Thr lie Pro 
710 715 720 

CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG .AAC 22 6 6 

Gin Asp Arg Leu Thr Glu Val lie Ala Ser He He Lys Pro Gin Asn 
■ 725 730 735 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 2314 
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 
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CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG ACA GAC "CTC 2 3 62 

His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu 

755 760 765 

CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG GAG ACC AGC CCG 24 10 

Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser Pro 

770 775 780 785 

CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC . TCC CTG AAT GAG GCC 24 58 

Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu Ala 
790 795 800 

AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC TTC ATG TGC CAC CAC GCC 25 0 6 

Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 

805 810 815 

-GTG CGC ATC AGG GGC AAG TCC TAC GTC CAG TGC CAG GGG ATC CCG CAG 25 5 4 

Val" Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly Tie Pro Gin 

820 825 830 

■GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC CTG TGC TAC GGC GAC ATG 2 602 

Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 

S35 840 845 

GAG AAC AAG CTG. TTT GCG GGG ATT CGG CGG GAC GGG CTG CTC CTG CGT 2 6 50 

Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu Arg 

350 855 . 860 " 865 

TTG GTG GAT GAT TTC TTG TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA 2 6 98 

Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
870 875 880 

ACC TTC CTC AGG ACC CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG 27 4 6 

Thr Phe Leu Arg Thr Leu Val Arg Gly' Val Pro Glu Tyr Gly Cys Val 

335 390 895 

GTG AAC TTG CGG AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC 2"7-94 

Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala 

900 ' 905 910 

CTG GGT GGC ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC 28 4 2 

Leu Gly Gly Thr Ala Phe Val Gin Met: Pro Ala His Gly Leu Phe Pro 

915 920 925 

TGG TGC GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC '2890 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Th'r Leu Glu Val Gin Ser Asp 

930 ^ ' ' 935 * 940 945 

TAC TCC AGC TAT GCC CGG ACC TCC ATG AG A GCC AGT CTC ACC TTC AAC 2 938 

Tyr Ser Ser Tyr Ala Arg Thr Ser He Arc Ala Ser Leu Thr Phe Asn 
950 . 953 960 

CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC 2 93 6 

Arg Gly Phe Lys Ala Gly Arg Asn Met Ar; Arq Lys Leu Phe Gly Val 

965 " 970 975 

TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG AAC AGC . 3034 

Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 

930 985 990 
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CTC CAG ACG GTG TGC ACC AAC ATC TAG AAG ATC CTC CTG CTG CAG GCG 3 08 2 

Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin Ala 

995 1000 1005 

TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG CAA GTT 3130 

Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 

1010 1015 1020 1025 

TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC TCT GAC ACG GCC TCC 317 8 

Trp Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser Asp Thr Ala Ser 

1030 1035 1040 

CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC GCA GGG ATG TCG CTG GGG 322 6 

Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 

1045 1050 1055 

GCC AAG GGC GCC GCC GGC CCT CTG CCC TCC GAG GCC GTG CAG- TGG CTG 3274 

Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 

1060 1065 1070 

TGC CAC CAA GCA " TTC CTG CTC AAG CTG ACT CGA CAC CGT GTC -ACC TAC 3322 

Cys His Gin Ala Phe ^Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 

1075 " 1080 1085 

GTG CCA CTC CTG GGG TCA CTC AGG ACA GCC CAG. ACG CAG CTG AGT CGG 3370 

Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arc 

1090 1095 1100 1105 

AAG CTC CCG GGG ACG ACG CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG 34 18 

Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 

1110 1115 1120 

GCA CTG CCC TCA GAC TTC AA.G ACC ATC CTG GAC TGATGGCCAC CCGCCCACAG 34 7 1 
A la Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1125 L U30 



3 CCG AG AGCAGACACC AGCAGCCCTG TCACGCCGGG CTC? ACG TCC CAGGGAGGGA J531 

:CGCCC ACACCCAGGC CCGCACCGCT GGGAGTCTGA GGCCTGAGTG AGTGTTTGGC 3 591 

CGAGGCCTGC ATGTCCGGCT GAAGGCTGAG TGTCCGGCTG AGGCCTGAGC GAGTGTCCAG 3 651 

CCAAGGGCTG AGTGTCCAGC ACACCTGCCG TCTTCACTTC CCCACAGGCT GGCGCTCGGC 3711 

TCCACCCCAG GGCCAGCTTT TCCTCACCAG GAGCCCGGCT TCCACTCCCC ACATAGGAAT 3771 

AGTCCATCCC CAGATTCGCC ATTGTTCACC CCTCGCCCTG CCCTCCTTTG CCTTCCACCC 38 31 

CCACCATCCA GGTGGAGACC CTGAGAAGGA CCCTGGGAGC TCTGGGAAT? TGG AG TG ACC 38 91 

AAAGGTGTGC CCTGTACACA GGCGAGGACC CTGCACCTGG ATGGGGGTCC CTGTGGGTCA 3951 

AATTGGGGGG AGGTGCTGTG GGAGTAAAAT ACTGAATATA TGAGTTTTTC AGTTTTGAAA 4011 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
1 ' 5 '10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 

Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 40 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 70 75 80 

Val Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
85 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

■Glu Ala Phe Thr Thr Ser Val Arc Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 125 

Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 " 135 . 140 

Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
145 * 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 170 175 

Gin Leu Gly Ala Ala' Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
18 0 185 190 

Pro Arg Arc Arg Leu Gly Cys .Glu Arg Ala Trp Asn His Ser Val Arg 

195 200 - ' 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 " 215 220" 

Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro. Arg Arg 
225 230 235 * ' 240 

Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 " 255 

Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 



Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280. 285 



Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser 
355 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 375 380 

Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
335 - 390 ■ 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 '425 4 30 

Giy^Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
435 440 445 



/ai G 



In Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
450 455 - 460 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
A 6 5 " 470 475 4 80 

Arg His Asn Giu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
435 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500_ 505 510 

Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 * 520 525' 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe 
530 535 540 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
54 5 1 550 555 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 
565 570 575 



Arg Lys Ser Val" Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His 
580 5S5 590 
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Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val "Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg 
675 . 680 685 

Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 710 715 720 

Pro Gin Asp Arc Leu Thr Glu Val lie Ala Ser lie He Lys Pro Gin 
725 730 ~ 735 

Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 
7 40 74 5 7 50 

Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 
755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 " 775 780 

Pro Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu 
755 ' " 790 795 800 

■Ala Ser Ser Glv -Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His 
805 * 810 * 815 

Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly lie Pro 
820, 825 830 

Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
835 840 845 

Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu 
S50 855 860 

Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 
865 ' 870 875 830 

Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys 
885 330 . 895 

Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu 
SOO " 905 910 
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Ala Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 935 940 

Asp Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe 
945 950 955 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
965 970 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
980 985 990 

Ser Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin 
995 . 1000 1005 

Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 1015 1020 

Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser Asp Thr Ala 
1025 \ ~ 1030 1035 1040 

Ser Leu Cys Tvr Ser lie Leu Lys- Ala Lys Asn Ala Gly Met Ser Leu 
■ " 1045 1050 1055 

Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp 
1060 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 
1075 10SO 1035 

Tyr Val Pro Leu' Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 - 1100 

Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asr. 
1 1 6 5 . ' 1110 1115 112: 

Pro Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1125 " 1130 . 



( 2 ; I N FORMAT ION FOR SEQ ID NO : 3 : 

(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 

(B) TYPE: nucleic acid 

(C) S T RAN DE DN ES S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(i:<) FEATURE : 

(A) NAME /KEY : - 
' (B) LOCATION: 1, .2176 
(D)" OTHER INFORMATION: /note= "clone 712562" 



12 



(ix) FEATURE: 

(A) NAME /KEY ; CDS 

(3) LOCATION: 23. . 302 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

GGCCAAGTTC CTGCACTGGC TGATGAGTGT GTACGTCGTC GAGCTGCTCA GGTCTTTCTT 60 

TTATGTCACG GAGACCACGT TTCAAAAGAA CAGGCTCTTT TTCTACCGGA AGAGTGTCTG 120 

GAGCAAGTTG CAAAGCATTG GAATCAGACA GCACTTGAAG AGGGTGCAGC TGCGGGAGCT 180 

GTCGGAAGCA GAGGTCAGGC AGCATCGGGA AGCCAGGCCC GCCCTGCTGA CGTCCAGACT 24 0 

CCGCTTCATC CCCAAGCCTG ACGGGCTGCG GCCGATTGTG AACATGGACT ACGTCGTGGG 300 

AGCCAGAACG TTCCGCAGAG AAAAGAGGGC CGAGCGTCTC ACCTCGAGGG TGAAGGCACT 3 60 

GTTCAGCGTG CTCAACTACG AGCGGGCGCG GCGCCCCGGC CTCCTGGGCG CCTCTGTGCT 4 20 

GGGCCTGGAC GATATCCACA GGGCCTGGCG CACCTTCGTG CTGCGTGTGC GGGCCCAGGA 4 80 

CCCGCCGCCT GAGCTGTACT TTGTCAAGGT GGATGTGACG GGCGCGTACG ACACCATCCC .540 

CCAGGACAGG CTCACGGAGG TCATCGCCAG CATCATCAAA CCCCAGAACA CGTACTGCGT 600 

GCG7CGGTAT GCCGTGGTCC AGAAGGCCGC CCATGGGCAC GTCCGCAAGG CCTTCAAGAG 6 60 

CCACGTCCTA CGTCGAGTGC CAGGGGATCC CGCAGGGCTC CATCCTCTCC ACGCTGCTCT 7 20 

GCAGCCTGVG CTACGGCGAC A TG GAGA-AC A AGCTGTTTGC GGGGATTCGG CGGGACGGGC 7 80 

7GC7CCTGCG TTTGGTGGAT GATTTCTTGT TGGTGACACC TCACCTCACC CACGCGAAAA 84 0 

7777CCTCAG GACCCTGGTC CGAGGTGTCC CTGAG* ATGG CTGCGTGGTG AACTTGCGGA 900 

AGACAGTGGT. GAACTTCCCT GTAGAAGACG AGGCCCTGGG TGGCACGGCT- TTTGTTCAGA 960 

7GCCGGCCCA CGGCCTATTC CCCTGGTGGG GCCTGCTGCT GGATACCCGG ACCCTGGAGG 1020 

TGCAGAGCGA CTACTCCAGC TATGCCCGGA CCTCCATCAG AGCCAGTCTC ACCTTCAACC 1080 

GCGGCTTCAA GGCTGGGAGG AACATGCGTC GCAAAC7CTT TGGGGTCTTG CGGCTGAAGT 114 0 

GTCACAGCCT. GTTTCTGGAT TTGCAGGTGA ACAGCCTCCA GACGGTGTGC ACCAACATCT 1200 

ACAAGATCCT CCTGCTGChG GCGTACAGGT TTCACGCATG TGTGCTGCAG CTCCCATTTC 1260 

ATCAGCAAGT 7TGGAAGAAC CCCACATTTT TCCTGCGCGT CATCTCTGAC ACGGCCTCCC " 132 0 

7CTGCTACTC 'CATCCTGAAA GCCAAGAACG CAGGGA7G7C GCTGGGGGCC AAGGGCGCCG 138 0 

CCGGCCNTCT GCCCTCCGAG GCCGTGCAGT GGCTG7GCCA CCAAGCATTC CTGCTCAAGC 144 0 

7GACTCGACA CCGTG7CACC TACGTGCCAC TCC7GGGGTC -AC7CAGGACA GCCCAGACGC 1500 

AGCTGAGTCG G^GCTCCCG GGGACGACGC TGACTGCCCT GGAGGCCGCA GCCAACCCGG 15 60 

CACTGCCCTC AGACTTCAAG ACCATCCTGG ACTGA7GGCC ACCCGCCCAC AGCCAGGCCG 162 0 
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AGAGCAGACA CCAGCAGCCC TGTCACGCCG GGCTCTACGT CCCAGGGAGG GAGGGGCGGC 168 0 

CCACACCCAG GCCTGCACCG CTGGGAGTCT GAGGCCTGAG TGAGTGTTTG GCCGAGGCCT 17 4 0 

GCATGTCCGG CTGAAGGCTG AGTGTCCGGC TGAGGCCTGA GCGAGTGTCC AGCCAAGGGC 1800 

TGAGTGTCCA GCACACCTGC CGTCTTCACT TCCCCACAGG CTGGCGCTCG GCTCCACCCC 1860 

AGGGCCAGCT TTTCCTCACC AGGAGCCCGG' CTTCCACTCC CCACATAGGA ATAGTCCATC 192 0 

CCCAGATTCG CCATTGTTCA CCCCTCGCCC TGCCCTCCTT TGCCTTCCAC CCCCACCATC 198 0 

CAGGTGGAGA CCCTGAGAAG GACCCTGGGA GCTCTGGGAA TTTGGAGTGA CCAAAGGTGT 204 0 

GCCCTGTA.CA CAGGCGAGGA CCCTGCACCT GGATGGGGGT CCCTGTGGGT CAAATTGGGG 2100 

GGAGGTGCTG TGGGAGTAAA ATACTGAATA TATGAGTTTT TCAGTTTTGN AAAAAAAAAA 2160 
AAAAAAAAAA AAAAAA 217 6 

(2) INFORMATION FOR SEQ ID NO: A : 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3855 base pairs 
( 3 } TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY: - ■ 
■ (3) LOCATION: 1 . , 3355 

(D'J OTHER INFORMATION : /note-** "nucleic acid sequence v;it:h an 

open reading frame encoding a celt a- 18 2 
variant polypeptide" 

(!:•:) FEATURE : 

(A) NAME/ KEY: CDS 

(B) LOCATION: 56 . . 2479 

(D) OTHER INFORMATION: /product* "delta-132 variant 

polypeptide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 



CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC ■ 106 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arc; Ser His 

5 10 15 

TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 15-1 

Tyr Arg Glu'ValLeu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 

20 25 30 
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CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 

Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 

35 40 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 

Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 

50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 

Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 

Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 

85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 

Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 

100 105 110 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 

Ala Phe Thr Thr Se.r Val Arg Ser Tyr Leu Pro Asn Thr Val Thr Asp 

115 " ' 120 125 

GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC" 

Ala Leu Arc Gly Ser' Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 

130 " 135 - " 140 145 

GAC GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 

Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 155 160 

GTG GCT CCC AGC TGC GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 

Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr'Gln 

165 170 175 

CTC GGC GCT GCC ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC 

Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala 'Ser Gly Pro 

130 185 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG 

Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 

195 200 .205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 

Ala, Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 

210 215 220 225 

GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG. CGT GGC 

Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly 
230 235 ' 240 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 

Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 

245 250 255 

CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT GTG GTG 

His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 

"260 265 270 
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TCA CCT GCC AG A CCC GCC GAA GAA GCC ACC TCT TTG GAG GGT GCG CTC 922 
Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 280 285 

TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC CAG CAC CAC GCG 970 
Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
290 , 295 300 305 

GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 1018 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC CTC TAG TCC TCA GGC GAC 1066 
Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 

AAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG AGG CCC AGC 1114 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu A.rg Pro Ser 
340 345 350 

CTG ACT- GGC GCT CGG AGG. CTC GTG GAG ACC ATC TTT. CTG GGT TCC AGG 1162 
Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser Arg 
355 ' 360 365 

CCC TGG ATG CCA GGG ACT CCC CGC AGG TTG CCC CGC CTG CCC CAG CGC 1210 
Pre Trp Ken Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 
370 * '375 380 385 

TAC TGG CAA ATG CGG CCC CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG 125 3 

Tyr Trp Gin Ket Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 

CAG TGC CCC TAC GGG GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT 1306 
Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 

4 0 5 4 1 0 - 4 15 

GCG GTC ACC CCA GCA GCC GGT GTC TGT GCC CGG GAG -AAG CCC CAG GGC 1-3 5 4 

Ala Val Thr. Pro Ala. Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420 " 425 430 

TCT GTG GCG GCC CCC GAG GAG GAG GAC ACA GAC CCC CGT CGC CTG GTG 14 02 

Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
.435 440 445 

CAG CTG CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG 14 5 0 

Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
4 50 455 . 4 60 "465 

CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 14 98 

Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 

CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG ■ AAG TTC ATC TCC CTG 154 6 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys ?he lie Ser Leu 
485 490 495 

GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG ATG AGC 159 4 

Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met: Ser 
500 505 510 
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GTG CGG GAC.TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT . 1642 

Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 

515 520 525 

CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 1690 

Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe Leu 

530 535 540 545 

CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG CTC AGG TCT TTC TTT 17 38 

His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 

550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG CTC TTT TTC TAC CGG 178 6 

Tyr Val. Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 

565 570 575 

AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AGA CAG CAC TTG 183 4 

Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly He Arg Gin His Leu 
580 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 18 8 2 

Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 

595 " 600 605 

CGG- GAA GCC AGG CCC GCC CTG CTG ACG TCC AGA CTC CGC TTC ATC CCC 19 30 

Arg Glu Ala Arg Pro Ala Leu Leu . Thr Ser Arg Leu Arg Phe He Pro 

610 615 ~ 620 625 

AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 1978 

lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 

630 635 640 

GCC AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG 202 6 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 

64 5 65C ~ 655 

GTG AAG GCA CTG TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC- CCC - " "207-4 ■ 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Aia Arg Arg Pro 
660 * 665 670 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 2122 

Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala 

675 680 685 

TGG CGC ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 2170 

T.rp Arg Thr Phe Val Leu Arg Val Arg Aia Gin Asp Pro Pro Pro Glu 

690 " 695 700 * 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC. 2218 

Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr lie Pro 

710 . 715 720 

CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC .AAA CCC CAG AAC 22 66 

■Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin Asn 

725 730 735 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 2314 

Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 
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CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA GTG CCA GGG 2362 
His Val Arg Lys Ala Phe Lys Ser His Val Leu Arg Pro Val Pro Gly 
755 760 765 

GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG CAG CCT GTG CTA 24 10 
Asp Pro Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu 
770 775 780 785 

CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT TCG GCG GGA CGG GCT 24 58 
Arg Arg His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala 
790 795 800 

GCT CCT GCG TTT GGT GGA TGATTTCTTG TTGGTGACAC CTCACCTCAC 2506 
Ala Pro Ala Phe Gly Gly 
805 

CCACGCGAAA ACCTTCCTCA GGACCCTGGT CCGAGGTGTC CCTGAGTATG GCTGCGTGGT 2566 

GAACTTGCGG AAGACAGTGG TGAACTTCCC TGTAGAAGAC GAGGCCCTGG GTGGCACGGC 2 62 6 

TTTTGTTCAG A7GCCGGCCC ACGGCCTATT CCCCTGGTGC GGCCTGCTGC TGGATACCCG 2 68 6 

GACCCTGGAG- GTGCAGAGCG ACTACTCCAG CTATGCCCGG ACCTCCATCA GAGCCAGTCT 2746 

CACCTTCAAC CGCGGCTTCA AGGCTGGGAG GAACATGCGT CGCAAACTCT TTGGGGTCTT 2806 

GCGGCTGAAG TGTCACAGCC TGTTTCTGGA TTTGCAGGTG AACAGCCTCC A.GACGGTGTG 28 66 

CACCAACATC TACAAGATCC TCCTGCTGCA GGCGTACAGG TTTCACGCAT GTGTGCTGCA 2 92 6 
GCTCCCATTT CATCAGCAAG TTTGGAAGAA CCCCACATTT TTCCTGCGCG TCATCTCTGA . 2 98 6 

CAGGGCCTCG CTCTGCTACT C CATC CTG AA AGCCAAGAAC GCAGGGATGT CGCTGGGGGC 304 6 

CAAGGGCGCC GCCGGCCCTC TGCCCTCCGA GGCCGTGCAG TGGCTGTGCC ACCAAGCAT7 3106 

CCTGC7CAAG CTGACTCGAC ACCGTGTCAC CTACGTGCCA CTCCTGGGGT CACTCAGGAC 3166 

AGCCCAGACG CAGCTGAGTC GGAAGCTCCC GGGGACGACG CTGACTGCCC TGGAGGCCGC 32 2 6 

AGCCAACCCG GCACTGCCCT CAGACTTCAA GACCATCCTG GACTGATGGC CACCCGCCCA 328 6 

CAGCCAGGCC GAGAGCAGAC ACCAGCAGCC CTGTCACGCC GGGCTCTACG TCCCAGGGAG 334 6 

GGAGGGGCGG CCCACACCCA GGCCCGCACC GCTGGGAGTC TGAGGCCTGA GTGAGTGTTT 340 6 

GGCCGAGGCC TGCATGTCCG GCTGAAGGGT GAGTGTCCGG CTGAGGCCTG AGCGAGTGTC 34 66 

CAGCCAAGGG CTGAGTGTCC AGCACACCTG CCGTCTTCAC TTCCCCACAG GCTGGCGCTC 3 52 6 

GGCTCCACCC CAGGGCCAGC TTTTCCTCAC CAGGAGCCCG GCTTCCACTC CCCACATAGG 3 58 6 

AATAGTCCAT CCCCAGATTC GCCATTGTTC ACCCCTCGCC CTGCCCTCCT TTGCCTTCCA 364 6 

CCCCCACCAT CCAGGTGGAG ACCCTGAGAA GGACCCTGGG AGCTCTGGGA ATTTGGAGTG 3706 

ACCAAAGGTG TGCCCTGTAC ACAGGCGAGG ACCCTGCACC TGGATGGGGG TCCCTGTGGG 37 6 6 

TCAAATTGGG GGG AGGTGCT GTGGGAGTAA AATACTGAAT ATATGAGTTT TTCAGTTTTG 332 6 

AAAAAAAAAA AAJLAAAAAAA. AAAAAAAAA 38 5 5 
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(2) INFORMATION FOR SEQ ID NO;5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
1 5 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 ■ ■ 30 

Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 " 40 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
5 0 5 5 60 

Pro Pro Ala Ala Pro Ser Phe Arg Gin *Val Ser Cys Leu Lys Glu Leu 
65- 70 75 "80 

Val Ala Arg Val Leu G-ln Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
35 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 10*5 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 " 125 



u Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arc Arg Val 
130 * 135 ' .140 * . . 



Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 

145 " * 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 

165 170 - 175 

Gin Leu Gly Ala Ala Thr Gin, Ala Arg Pro Pro Pro His Ala Ser Gly 

180 * ' 135 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 

■ 195 ' 200 205 . 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 

. 210 215 220 

Gly Gly Ser Ala Ser Arc Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 

225 230. 235 " 240 

Gly Ala Als Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 

245 " 250 255 



Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 " " 265 270 
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Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 - 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser 
355 360 _ 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg A.rg Leu Pro Arg Leu Pro Gin 
370 375 380 

Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
3S5 ~ 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys 'Pro Leu Arg 
405 410 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 ** 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arc Arg Leu 

4 3 5 4 4 0 4 4 5 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
4 50 4 55 - 460 

Val Arg Ala Cys Leu Arc Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
465 470 " 475 * 480 

Arc His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
4 85 4 90 4 95 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arc Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 520 ' 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe 
530 535 540' 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 " 550 555 ' 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arc Leu Phe Phe Tyr 
565 570 ' 575 

Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arc Gin His 
580 ' 585 590 
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Leu .Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 

595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 

610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val 

625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 

660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg 

675 680 635 

Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 

690 695 700 

Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr lie 

7C5 - 710 715 720- 

Pro Gin Asp Arg Leu Thr Glu Val lie Ala Ser lie lie Lys Pro- Gin 
725 730 * 735 

Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 

740 ^ 745 750 

Gly His Val Arg Lys Ala Phe Lys Ser His Val Leu Arg Pro Val Pro 

755 ' 760 765 

Gly Asp Pro Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val 

770 * ■ " 775 730 * 

Leu Arg Arc His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg 

785 " * 790 795 . 800 

■Ala Ala Pro Ala Phe Gly Gly 
805 



lN FORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4335 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) KOLECULE TYPE: DNA (genomic) 



(ix) FEATURE : 

(A) NAME/KEY: - 

(3) LOCATIONrM . • 4335 

(D) OTHER INFORMATION:' /note= "genomic DNA insert of pGRNl4 4" 
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(ix) FEATURE: 

(A) NAME /-KEY : intron 

(B) LOCATION: 2715.. 2818 

(D) OTHER INFORMATION: /note- "intron 1 " 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4173.. 4326 

■ (D) OTHER INFORMATION : /note- "intron 2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

CCATGGGACC CACTGCAGGG GCAGCTGGGA GGCTGCAGGC TTCAGGTCCC AGTGGGGTTG 60 

CCATCTGCCA GTAGAAACCT GATGTAGAAT CAGGGCGCGA GTGTGGACAC TGTCCTGAAT 120 

CTCAATGTCT CAGTGTGTGC TGAAACATGT AGAAATTAAA GTCCATCCCT CCTACTCTAC .180 

TGGGATTGAG CCCCTTCCCT ATCCCCCCCC AGGGGCAGAG GAGTTCCTCT CACTCCTGTG 240 

GAGGAAGGAA TGATACTTTG TTATTTTTCA CTGCTGGTAC TGAATCCACT GTTTCATTTG 300 

TTGGTTTGTT TGTTTTGTTT TGAGAGGCGG TTTCACTCTT GTTGCTCAGG CTGGAGGGAG 360 

7GCAATGGCG CGATCTTGGC TTACTGCAGC CTCTGCCTCC CAGGTTCAAG TGATTCTCCT 4 20 

GCTTCCGCCT CCCATTTGGC TGGGATTACA GGCACCCGCC ACCATGCCCA GCTAATTTTT 4 80 

7GTATTTTTA GTAGAGACGG GGGTGGGGGT GGGG7TCACC ATGTTGGCCA GGCTGGTCTC 54 0 

GAACTTCTGA. CCTCAGATGA TCCACCTGCC TCTGCCTCCT AAAGTGCTGG GATTACAGG7 600 

G7GAGCCACC ATGCCCAGCT CAGAATTTAC TCTGTTTAGA AACATCTGGG TCTGAGGTAG 660 

GAAGCTCACC CCACTCAAGT GTTGTGGTGT TTTAAJGCCAA TGATAGAAT7 T7TTTATTG7 72 0 

7G7TAGAACA CTCTTGATGT TTTACACTGT GATGACTAAG ACATCATCAG CTTTTCAAAG ' 780 

ACACACTAAC TGCACCCATA ATACTGGGGT GTCTTCTGGG TATCAGCGAT CTTCATTGAA 84 0 

TGCCGGGAGG CGTTTCCTCG CCATGCACAT GGTGTTAATT ACTCCAGCAT AATCTTC7GC 900 

TTCCAT7TCT TCTCTTCCCT CTTTTAAAAT TGTGTTTTCT ATGTTGGCTT CTCTGCAGAG 960 

AACCAGTGTA AGCTACAACT TAACTTTTGT TGGAACAAAT TTTCCAAACC GCCCCTTTGC 102 0 

CCTAGTGGCA GAGACAATTC ACAAACACAG CCCTTTAAAA AGGCTTAGGG ATCACTAAGG 108 0 

GGATTTCTAG AAGAGCGACC CGTAATCCTT AAGTATTTAC AAGACGAGGC TAACCTCCAG 1140 

CGAGCGTGAC AGCCCAGGGA GGGTGCGAGG CCTG7TCAAA TGCTAAGCTT CCA.TAAAT.-_A 1200 

AGCAAATTTC CTCCGGCAGT TTCTGGAAAG TAGGAAAGGT TAACATTTAA GGTTGCGT7T 1260 

GTTAGCATTT CAGTGTTTGC CGACCTCAGC TAACAGCATC CCTGCAAGGC CTCGGGAGAC 1320 

CCAGA_AGTTT CTCGCCCCTT AGATCCAAAC TTGAGCAACC CGGAGTCTGG ATTCCTGGGA 138 0 

AGTCCTCAGC TGTCCTGCGG TTGTGCCGGG GCCCCAGGTC TGGAGGGGAC CAGTGGCCGT 144 0 
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GTGGCTTCTA CTGCTGGGCT GGAAGTCGGG CCTCCTAGCT' CTGCAGTCCG AGGCTTGGAG .1500 

CCAGGTGCCT GGACCCCGAG GCTGCCCTCC ACCCTGTGCG GGCGGGATGT GACCAGATGT 15 60 

TGGCCTCATC TGCCAGACAG AGTGCCGGGG CCCAGGGTCA AGGCCGTTGT GGCTGGTGTG 1620 

AGGCGCCCGG TGCGCGGCCA GCAGGAGCGC CTGGCTCCAT TTCCCACCCT TTCTCGACGG 168 0 

GACCGCCCCG GTGGGTGATT AACAGATATT GGGGTGGTTT GCTCATGGTG GGGACCCCTT 1*7 4 0 

CGCCGCCTGA GAACCTGCAA AGAGAAATGA GGGGGGTGTG TCAAGGAGCC CAAGTCGCGG 18 00 

GGAAGTGTTG CAGGGAGGCA CTCCGGGAGG TCCCGCGTGC CCGTCCAGGG AGCAATGCGT 18 60 

CCTCGGGTTC GTCCCCAGCC GCGTCTACGC GCCTCCGTCC TCCCCTTCAC GTCCGGCATT 1920 

CGTGGTGCCC GGAGCCCGAC GCCCCGCGTC CGGACCTGGA GGChGCCCTG GGTCTCCGGA 1980 

TCAGGCCAGC GGCCAAAGGG TCGCCGCACG CACCTGTTCC CAGGGCCTCC ACATCATGGC 2 04 0 

CCCTCCCTCG GGTTACCCCA CAGCCTAGGC CGATTCGACC TCTCTCCGCT GGGGCCCTCG 2100 

CTGGCGTCCC TGCACCCTGG GAGCGCGAGC GGGGCGCGGG CGGGGAAGCG CGGCCCAGAC 2160 

CCCCGGGTCC GCCCGGAGCA GCTGCGCTGT CGGGGCCAGG CCGGGCTCCC AGTGGATTCG 2220 

GGGGCAACAG ACGCCCAGGA CCGCGCTTCC CACGTGGCGG AGGGACTGGG GACCCGGGCA 2 2 80 

CCGGTCCTGC CCCTTCACCT TCCAGCTCCG CCTCGTCCGC GCGGPACGCG GCGCCGTCCC 23 4 0 

GAACCCTTCC CGGGTCCCCG GCC.CAGCCCC TTCCGGGCCA TCCCAGCCCG TCCCGTTCCT 2 4 00 

777CCGCGGC CCCGCCCTCT CCTCGCGGGG CGAGTTTCAG GCAGCGCTGC GTCCTGCTGC 2 4 60 

GCACGTGGGA AGCCGTGGCC CCGGCCACCC CCGCGATGCC GCGCGCTCCC CGCTGCCGAG 2 520 

CCGTGCGCTC CCTGGTGCGC AGCCACTACC GCGAGGTGCT GCCGCTGGCC- ACGTTGGTGC '2580 

GGGGCCTGGG GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGGG 2 64 0 

CGGTGGTGGC CCAGTGCCTG GTGTGCGTGC CCTGGGACGC hCGGGCGCCC CCQGCCGCCC 2 7 00 

CC7GCTTCCG CCAGGTGGGC CTCGCCGGGG TCGGCGTCCG GCTGGGGTTG AGGGCGGCCG 2 7 60 

GGGGGAACCA GCGACATGCG GAGAGCAGCG CAGGGGACTC AGGGCGC77C CCCCGCAGGT 2 320 

GTCCTGCCTG AAGGAGCTGG TGGCCCGAGT GCTGCAGAGG GTGTGCGAGC GCGGCGCG:-A 2 83 0 

GAACGTGCTG GGCTTCGGCT TCGCGCTGCT GGACGGGGCC CGCGGGGGCC CCC.CCGAGGC 2 94 0 

CT7CACCACC AGCGTGCGCA GCTACCTGCC' CAACACGGTG ACCGACGCAC ' TGCGGGGGAG 3000 

CGC-GGCGTGG GGGCTGCTGC TGCGCCGCGT GGGCGACGAC GTGCTGGTTC ACCTGCTGGC 30 60 

AGGCTGCGCG CTCTTTGTGC TGGTGGCTCC CAGCTGCGCC TACGAGGTG7 GCGGGCCGCG 3120 

GG7G7ACCAG CTGGGGGGTG CCACTCAGGC CCGGCCCCCG CCACACGCTA GTGGACCCCG 318 0 

AAGGCGTCTG GGATGCGAAC GGGCCTGGAA CCATAGCGTC AGGGAGGCCG GGGTCCCCCT 32 4 0 

GGGCCTGCCA GCCCCGGGTG CGAGGAGGCG CGGGGGCAGT GCCAGCCGAA GTCTGCCGTT 3300 
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GCCCAAGAGG CCCAGGCGTG GCGCTGCCCC TGAGCCGGAG CGGACGCCCG TTGGGCAGGG 33 60 

GTCCTGGGCC CACCCGGGCA GGACGCGTGG ACCGAGTGAC CGTGGTTTCT GTGTGGTGTC 34 20 

ACCTGCCAGA CCCGCCGAAG . AAGCCACCTC TTTGGAGGGT GCGCTCTCTG GCACGCGCGA 34 80 

CTCCCACCCA TCCGTGGGCC GCCAGCACCA CGCGGGCCCC CCATCCACAT CGCGGCCACC 3 54 0 

ACGTCCCTGG GACACGCCTT GTCCCCCGGT GTACGCCGAG ACCAAGCACT TCCTCTACTC 3 600 

CTCAGGCGAC AAGGAGCAGC TGCGGCCCTC CTTCCTACTC AGCTCTCTGA GGCCCAGCCT 3 6 60 

GACTGGCGCT CGGAGGCTCG TGGAGACCAT CTTTCTGGGT TCCAGGCCCT GGATGCCAGG 37 20 

GACTCCCCGC AGGTTGCCCC GCCTGCCCCA GCGCTACTGG CAAATGCGGC CCCTGTTTCT 37 80 

GGAGCTGCTT GGGAACCACG CGCAGTGCCC CTACGGGGTG CTCCTCAAGA CGCACTGCCC 38 4 0 

GCTGCGkGCT GCGGTCACCC GAGGAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 3 9 00 

TGTGGCGGCC CCCGAGGAGG AGGACACAGA CCCCCGTCGC CTGGTGCAGC TGCTCCGCCA 3960 
GCACAGCAGC CCCTGGCAGG TGTACGGCTT CGTGCGGGCC TGCCTGCGCC GGCTGGTGCC " 4 020 

CCCAGGCCTC TGGGGCTCCA GGCACAACGA ACGCCGCTTC CTCAGGAACA CCAAGAAGTT 4 08 0 

CATCTCCCTG GGGAAGCATG CCAAGCTCTC GCTGCAGGAG CTGACGTGGA AGATGAGCGT 4 14 0 

GCGGGACTGC GCTTGGCTGC GCAGGAGCCC AGGTGAGGAG GTGGTGGCCG TCGAGGGCCC 4 200 

AGGCCCCAGA GCTGAATGCA GTAGGGGCTC A G AAA_A GGGG GCAGGCAGAG CCCTGGTCCT 4 2 60 

CCTGTCTCCA TCGTCACGTG GGCACACGTG GCTT7TCGCT CAGGACGTCG AGTGGACACG 4 320 

G7GATCGAGG TCGAC ' 4 335 

\2) INFORMATION FOR SEQ ID 1)0:1: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 240 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



(ix) FEATURE: 

( A ) NAME/ KEY : intron 
{3} LOCATION: 95 . . 1SS 

(D) OTHER INFORMATION: /notS= "intron 1' 



SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

GGf.CCCGGCG GCTTTCCGCG CGCTGGTGGC CCAGTGCCTG GTG7GCGTGC CCTGGGACGC 60 

ACGGCCGCCC CCCGCCGCCC CCTCCTTCCG CCAGGTGGGC CTCCCCGGGG TCGGCGTCCG 120 

GCTGGGGTTG AGGGCGGCCG GGGGGAACCA GCGACATGCG GAGAGCAG'CG CAGGCGACTC 18 0 
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AGGGCGCTTC CCCCGCAGGT GTCCTGCCTG AAGGAGCTGG TGGCCCGAGT GCTGCAGAGG 24 0 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 389 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..389 

(D) OTHER INFORMATION: /note= "expressed sequence tag (EST) 

AA281296" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT 60 

TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 12 0 

AGCAAGTTGC AAAGCA.TTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGACGTG 18 0 

TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 24 0 

CGC7TCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA ACATGGACTA CGTCGTGGGA 30 0 

GCC.-.GAACGT TCCGCAGAGA AAAGAGGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 360 

77:AGCGTG: 7CAACTACGA GCGGGCGCG - _ 38 9 

(2} INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE : 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 182 

(D) OTHER INFORMATION: /note= "182 basepair sequence deleted in 

clone 712562" 

(xi) SEQUENCE DESCRIPTION:. SEQ ID MO : 9 : 
TCTACCTTGA CAGACCTCCA GCCGTACATG CGACAGTTCG TGGCTCACCT GCAGGAGACC 60 
AGCCCGCTGA GGGATGCCGT CGTCATCGAG CAGAGCTCCT CCCTGAATGA GGCCAGCAGT 120 
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GGCCTCTTCG ACGTCTTGCT ACGCTTCATG TGCCACCACG CCGTGCGCAT CAGGGGCAAG 180 
TC 182 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION : 1 . . 25 9 

(D) OTHER INFORMATION: /note= "protein encoded by clone 712562" 



Txi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr 
1 5 10 15 

Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser He Gly He Arg Gin Kis Leu Lys Arg Val 
3 5 4 0* 4 5 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 -55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 
65 70 75 30 

Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Vai Gly Ala -Arg Thr 
85 " 90 95 

Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala 
100 105 110 

Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu 
115 J 120 ' 125 

Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala Trp Arg Thr 
130 ' 135 140 

Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe 
145 150 155 ■ " 160 

■ Val Lys Val Asp Val Thr Giy Ala Tyr Asp Thr lie Pro Gin Asp Arg 
165 170 175 



Leu Thr Glu Val He Ala Ser lie He Lys Pro Gin Asn Thr Tyr Cys 
180 185 190 
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Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly His Val Arg 
195 200 205 

Lys Ala Phe Lys Ser His Val Leu Arg Pro Val Pro Gly Asp Pro Ala 
210 215 220 

Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu Arg Arg His 
225 230 235 240 

Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala Ala Pro Ala 
245 250 255 

Phe Gly Gly 



INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : - peptide 



(ix) FEATURE : ' 

(A) NAME/ KEY : Modi f ied-si te 
(BJ LOCATION: 2 

(D) OTHER INFORMATION : /product- "OTHER" 

/note= "Xaa « Leu or lie" 



(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-si te 
(BJ LOCATION: 10 

(D) OTHER INFORMATION; /predict = "OTHER" 

/note- "Xaa --Leu or- lie" 

(ix) FEATURE : 

(A) NAME/ KEY : Modi f i ed-s i te 

(E) LOCATION: 11 

(D) OTHER INFORMATION: /product- " OTHER." 

/note- "Xaa = Leu or lie" 

(ix) FEATURE : 

(A) " NAME/KEY: Modifled-site 
(3) LOCATION: 12 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa - Gin or Arg" 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-s i te 
■(B) LOCATION: 2 8 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Phe or Tyr" 

(ix) FEATURE: 

(A) NAME /KEY: Modif ied-s it e 
( a ) LOCATION: 2 9 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Phe or Tyr" 
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(ix) FEATURE: 

(A) NAME/KEY : Modif ied-site 

(B) LOCATION : 31 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa - Lys or His" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 1 : 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp ■ 
15 10 15 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa 
.20 25 30 

Xaa Trp 



(2) INFORMATION FOR SEQ ID NO: 12; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

CA) NAME /KEY : Modi f ied-site 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER" 

/not e= "Xaa = Leu or lie" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 

(B) LOCATION: 10 

( D) OTHER INFORMATION: /product* "OTHER" 

/note- "Xaa = Leu or lie" 

(ix) FEATURE: 

(A) NAME/KEY : Modi f ied-site 

(B) LOCATION : 11 

( D ) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa « Leu or lie" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION : 12 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = Gin or Arc' 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 

(B) LOCATION: 2 9 

(D) OTHER INFORMATION: /procuct= "OTHER " 

/note* "Xaa = Phe or Tyr* 
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(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 30 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa ~ Phe or Tyr" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 32 

(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = Lys or His" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Trp Xaa Xaa Xaa -Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Trp 
1-5 10 15 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa 
20 25 30 

Xaa Xaa Trp 
35' 



INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE .CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 
(3) TYPE: amino acid 

(C) .STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION : I . . 12'9 

(D) OTHER INFORMATION: /note= "TRT motifs from human" 



(xi) SEQUENCE DESCRIPTION: SEQ iD-NO:13: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Va 1 Glu Leu Leu 
1 5 10 15 

Arg Ser Phe Phe Tyr Val' Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu 
20 . 25 30 

Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Giy He 
3 5 ' 4 0 4 5 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu 
50 55 60 

Val Arg Gin His A-rg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arc Leu 
65 " 70 75 80 



Arg Phe He Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp 
85 90 95 
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Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg 
100 105 110 

Leu Thr Ser 'Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg 
115 120 125 

Ala 



INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 
CC) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix}~ FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 233 

(D) OTHER INFORMATION: /note* M TRT motifs from 

Schizosaccharomvces oombe tezl' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

• lie Ser Glu lie Glu Trp Leu Val Leu Gly Lys Arg Ser Asn Ala Lys 
1 5 10 15 

Met Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin lie Phe Ala Glu Phe 
20 25 30 

lie Tyr Trp Leu Tyr Asn Ser Phe He lie Pro lie Leu Gin Ser Phe 
35^ 40 4 5 

Phe Tyr lie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe 
50 55 60 

Arg Lys Asp He Trp Lys Leu Leu Cys Arg Pro Phe lie Thr Ser Met 
65 " 70 75 80 

Lys Met Glu Ala Phe Glu Lys lie Asn Glu Asn Asn Val Arg Met Asp 
85 90 95 

Thr Gin Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys 
100 105 110 

Lys Asn Thr Phe Arg Leu He Thr Asn Leu Arc Lys Arg Phe. Leu lie 
115 12 0 12 5 

Lys Met Gly Ser Asn Lys Lys Met Leu Val Ssr Thr Asn Gin Thr Leu 
130 135 140 

Arg Pro Val Ala Ser He Leu Lys His Leu lie Asn Glu Glu Ser Ser 
145 . 150 155 160 



Gly He Pro Phe Asn Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys 
165 170 175 
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Lys Asp Leu Leu Lys His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val 

180 185 190 

Arg lie Asp lie Lys Ser Cys Tyr Asp Arg lie Lys Gin Asp Leu Met 

195 200 205 

Phe Arg lie Val Lys Lys Lys Leu Lys Asp Pro Glu Phe Val lie Arg 

210 215 220 



Lys Tyr Ala Thr He His Ala Thr Ser 
225 230 



INFORMATION FOR SEQ ID NO : 1 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



!ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3)" LOCATION: 1 . .233 

(D) OTHER INFORMATION: /note= " TRT motifs from Saccha rcmyces 

cere visiae . EST2 n 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Leu Lvs Aso Phe Arg Tro Leu Phe-Ile Ser Asp lie Trp Phe Thr Lys 

1 5 10" . . 15 

His Asp. Phe Glu Asn Leu Asn Gin Leu Ala lie Cys Phe lie Ser Trp 
20 -■ 25 30 

Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys 
35 J 40 45 

Thr Glu lie Ser Ser Thr Val Thr lie Val Tyr Phe Arg His Asp Thr 
50 55 60 

Trp Asn Lys Leu He Thr Pro Phe lie Val Glu Tyr Phe Lys Thr Tyr 
65* .70 75 ' ' 30 

Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser 
85 90 95 

Asn Phe Asn His Ser Lys Met Arg He lie Pro Lys Lys Ser Asn Asn 
100 105 110 

Glu Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu 
115 120 - 125 

Phe Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin 
130 135 140 
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Lys He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys 

145 - 150 155 160 

He Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin 
165 170 175 



Arg Leu Leu Lys Lys Phe Asn Asn 
180 

Lys Phe Asp Val Lys Ser Cys Tyr 

195 200 



Val Leu Pro Glu Leu Tyr Phe Met 
185 190 

Asp Ser He Pro Arg Met Glu Cys 
205 



Met Arg He Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val 

210 215 220 

Arg Ser Gin Tyr Phe Phe Asn Thr Asn 

225 . 230 



INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

CA) LENGTH: 233 amino acids 

(B) TYPE: amino acid 

(C) S T RAN DE DM ESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION : 1 . . 233 . 

{ D) OTHER INFORMATION: /note= "TRT motifs from Eupiotes 

aediculatus .ol23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:16: 

Thr Arg Glu lie Ser Trp Met Gin Val Glu Thr Ser- Ala Lys His Phe 
1 5 10 15 

Tyr Tyr Phe Asp His Glu Asn lie Tyr Val Leu Trp Lys Leu Leu Arg 
20 25 30 

■Trp lie Phe Glu Asp Leu Val. Val Ser Leu .lie Arg Cys Phe Phe Tyr 

35- ' 40 45 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
50 55 60 

Asn lie Trp Asp Val lie Met Lys Met Ser lie Ala As? Leu Lys Lys 
65 7 0 7 5 80 

Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu Glu Trp Lys Lys 
85 90 95 



Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro Lys Lys Thr 
100 * 105 110 
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Thr Phe Arg Pro He Met Thr Phe Asn Lys Lys lie Val Asn Ser Asp 
115 120 125 

Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu Asn Ser His 
130 135 140 

Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys Asp Pro Phe Gly 
145 150 155 160 

Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr Glu Glu Phe 
165 170 175 

Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe Phe Ala Thr 
180 185 190 

Met Asp He Glu Lys Cys Tyr Asp Ser Val Asn Arg Glu Lys Leu Ser 
195 200 205 

Thr Phe Leu Lys Thr "Thr Lys Leu Leu Ser Ser Asp Phe Trp He Met 
210 215 220 

Thr Ala Gin He Leu Lys Arg Lys Asn 
225 230 



) INFORMATION FOR SEQ ID NO : 1 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 
■ (C) STRANDEDNESS: 

{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oeotide 



(ix) FEATURE: " 
(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus telomerase RT sequence 

from motif T" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-sit e 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa ~ polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 7 : 

Xaa Phe Phe Tyr 
1 
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(2) INFORMATION FOR SEQ ID NO: 18: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

Cxi) MOLECULE TYPE: peptide 



(ix) FEATURE-: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 7 

(D) OTHER INFORMATION: /note^ "consensus telomerase RT sequence 

from motif 1" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-si te 

(B) LOCATION: 1 

(D) OTHER INFORMATION : /product- "OTHER" 

/note= "Xaa - hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro,' Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s it e 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa « hydrophobic amino acid, 
Ala , Leu, He, Val , Pro, Phe , Trp 
or Net" 



(:•:!)■ SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Xaa Arg Xaa lie Pro Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) - TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1.-4 

(D) OTHER INFORMATION: /no'.e* "consensus telomerase RT sequence 

from motif 2" 
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(ix) FEATURE: 

(A) NAME /KEY : Modi f ied- s i te 

(B) LOCATION: 3 

{ D) OTHER INFORMATION: /product- "OTHER" 

/rioter "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 : 



Phe Arg Xaa lie 
1 



(2) INFORMATION FOR SEQ I D NO: 20 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1..6 

(D) OTHER INFORMATION : /note- "consensus telo.nerase RT sequence 

from motif A" 



(ix) FEATURE :■ 

(A) NAME /KEY : Modi f ied-si te 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /products "OTHER " 

/note.= "Xaa « charged amino acid, Asp, 
Glu, His, Lys or Arg" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i t e 
(3) LOCATION: 6 

(D) OTHER INFORMATION: /product= "OTHER" 

/note=' "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp- 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID-MO:20: 

Pro Xaa Leu Tyr Phe Xaa 
1 5 ■ 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..6 

(D) OTHER INFORMATION: /note 58 "consensus teiomerase RT sequence 

from motif B ' " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

Gly lie Pro Gin Gly Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..4 

(D) OTHER INFORMATION: /note= "consensus teiomerase RT sequence 

from motif C" 



(xi) SEQUENCE' DESCRIPTION: SEQ ID NO:22: 
Leu Leu Arg Leu 



(2) INFORMATION FOR SEQ ID NO: 23: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1 amino acics 
(3) TYPE: amino acid 

(C) STRANDEDNESS.: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
{ B ) LOCATION : 1 . . 7 

(D) OTHER IN FORMATION: /note- "consensus teiomerase RT sequence 

from moti f C" 
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(ix) FEATURE: 

(A) NAME/KEY : Modi f ied-sit e 

(B) LOCATION: 5 

(D) OTHER INFORMATION; /product= "OTHER" 

/note= "Xaa - hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

Asp Asp Phe Leu Xaa lie Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 48 

(D) OTHER INFORMATION: /note= "motif T peptide from 

Schizosaccharomyces pombe TRT tezlp" 



(xi) SEQUENCE DESCRIPTION :- SEQ ID NO: 24: 

Trp Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 

1 5 " -10 15 

lie Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe.Arg Lys 

20 * 2 5 30 

Asp lie Trp Lys Leu Leu Cys Arg Pro Phe lie Thr Ser Met Lys Met 

3 5 .40 4 5 



(2) INFORMATION FOR SEQ ID NO : 2 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1.-54 
(D) OTHER INFORMATION 



/rioter "motif 1 and 2 peptide from 
Schizosaccharomyces pombe TRT tezlp" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Asn Asn Val Arg Met Asp Thr Gin Lys Thr Thr Leu Pro Pro Ala Val 
15 10 15 

lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu lie Thr Asn Leu 
20 25 . 30 

Arg Lys Arg Phe Leu lie Lys Met Gly Ser Asn Lys Lys Met Leu Val 
35 40 45 ■ 

Ser Thr Asn Gin Thr Leu 
50 



INFORMATION FOR SEQ ID NO: 26: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

( B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 34 . 

(D) OTHER INFORMATION: /note- "motif A peptide from 

Schizosaccharomyces pombe TRT te2lp" 



SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 : 

Phe Gly Arg Lys Lvs Tyr Phe Val Arg lie Asp lie Lys Ser Cys Tyr 
1 " 5* 10 15 

Asp Arc lie Lys Gin Asp Leu Met Phe Arg. lie Val Lys Lys Lys Leu 
20 25 30 

Lys Asp 



INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note* "motif B' peptide from 

Schizosaccharomyces pombe TRT tezlp" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Ser Gin Tyr Leu Gin Lys Val Gly lie Pro Gin Gly Ser lie Leu Ser 
1 5 10 15 

Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu lie Asp Glu Tyr Leu 
20 25 30 

Ser Phe Thr 
35 



INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE; 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . .'4 3 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

Schizosaccharomyces pombe TRT tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Leu Leu Arg Val Val Asp Asp Phe "Leu Phe lie Thr Val Asn Lys Lys 

1 5 10 15 

Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His 
2 0 2 5 3 0 

Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie 
35 40 



INFORMATION FOR SEQ ID NO : 2 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 
■(C) STRANDEDNESS; 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 11 " 

(D) OTHER INFORMATION: /no:e- "motif E peptide from 

Schizosaccharomyces pombe TRT tezlp" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val 
15 10 



(2) INFORMATION FOR SEQ ID NO: 30: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 48 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



lii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . .48 

(D) OTHER INFORMATION: /note- "motif T peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr 
15 10 15 



Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys 
20 25 30 



Ser Val Trp Ser Lys Leu Gin Ser lie Gly He Arg Gin His Leu Lys 



35 
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(2) INFORMATION FOR SEQ ID NO: 31: 



{i} SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 4 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1 . . 54 

(D) OTHER INFORMATION: /note- "motif 1 and 2 peptide from 

human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 31: 



Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg 

1 5 10 15 

Leu Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met 
20 25 30 
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Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu 
35 40 45 

Arg Leu Thr Ser Arg Val 
50 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acici 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION: /note- "motif A peptide from human TRT 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr 
1 5 10 15 

Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val lie Ala Ser lie He 
20 25 " 30 

Lys Pro ' 



(2} INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS * 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION : 1 . . 35 

( D) OTHER INFORMATION: /note* "motif 3' peptide from human TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin Gly Ser He Leu Ser 
1 ■ 5 - 10 15 



Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe 
20 25 30 
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Ala Gly He 
35 



(2) INFORMATION FOR SEQ ID NO : 3 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 4 3 

(D) OTHER INFORMATION: /note= "motif C and D peptide from 

human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr 

I 5 10 15 

His Ala Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr 
20 25 30 

Gly Cys Val Val Asn Leu Arg Lys Thr Val Val 
35 40 



(2) INFORMATION FOR SEQ ID NO; 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : ' 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 11 

(D) OTHER INFORMATION: /note= "motif E peptide- from hup.an TRT 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 5 : 



His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu 
1 5 10 
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INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 48 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE; 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..4 8 

(D) OTHER INFORMATION: /note- "motif T peptide from Euplotes 

aediculatus p!23 M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 6 : 

Trp He Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 

1 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 

20 25 30 

Asn He Trp Asp Val He Met Lys Met Ser lie Ala Asp Leu Lys Lys 
35 40 45 



INFORMATION FOR SEQ ID NO: 37: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 54 

(D) OTHER INFORMATION: /note= "motif 1 and 2 peptide from 

Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 

Lys Glu Val Glu Glu Trp Lvs Lys Ser Leu Gly Phe Ala Pro Gly Lys 
1 .5 " 10 15 

"Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe 
20 25 30 

Asn Lys Lys He Val Asn Ser Asp Arg Lys Thr Thr Lys Leu -Thr Thr 
35 40 45 

Asn Thr Lys Leu Leu Asn 
50 
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(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
( E ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . 34 

(D) OTHER INFORMATION: /note* "motif A peptide from Euplotes 

aediculatus pl23" 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr 
1 5 10 15 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
20 .25 30 

Leu Leu 



(2) INFORMATION FOR SEQ ID NO : 3 9 : 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 
(3) TYPE: amino acid' 

(C) STRANDEDNESS : 

(D) TO-POLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1. . 35 

(D) OTHER INFORMATION: /note« "motif B* peptide from Euplotes 

aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser 
1 5 10- 15 

Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser Leu 
20 25 30 

Gly Phe Leu 
35 
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{2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .43 

(D) OTHER INFORMATION: /note- "motif C and D peptide from 

Euplotes aediculatus pl23 n 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu Asn 
1 5 10 15 

Asn Ala Val Leu Phe lie Glu Lys Leu He Asn Val Ser Arg Glu Asn 
20 25 30 

Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr 
35 4 0 



(2) INFORMATION FOR SEQ ID NO : 4 1 : 

(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : - 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: . 

(A) NAME/ KEY : Peptide 

( B ) LOCATION : 1 . . 11 

(D) OTHER INFORMATION : /note- "motif E peptide from Euplot 

aediculatus p!23 M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 

Gin Asp Tyr Cys Asp Trp He Gly He Ser He 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..4 7 

(D) OTHER INFORMATION: /note- "motif T peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

Trp Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr 
1 5 10 15 

Cys Thr Glu lie Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 
- 20 25 30 

Thr Trp Asn Lys Leu He Thr Pro Phe He -Val Glu Tyr Phe Lys 
35 4.0 4 5 



(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . .26 

(DJ OTHER INFORMATION: /note* "motif 1 peptide from 

Saccharomyces cerevisiae EST2p* 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 : 

Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn Phe Asn His Ser Lys 
15 10 15 

Met Arg lie lie Pro Lys Lys Ser Asn Asn 
20 ' 25 



(2) INFORMATION FOR SEQ ID MO : A A : 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1..29 

. CD) OTHER INFORMATION: /note- "motif 2 peptide from 

Saccharomyces cerevisiae £ST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
5 10 15 

Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro 
20 25 

(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 34 

(D) OTHER INFORMATION : /note- "motif" A peptide from 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 4 5 : 

Val Leu Pro Glu Leu Tyr Phe Met Lys Phe Asp Vai Lys S-r Cys Tyr ' 
1 5 10 15 

Asp Ser lie Pro Arg Met Glu Cys Met Arg He Leu Lys Aso Ala Leu 
20 25 3C T 

Lys Asn - 

2) I N FORMATION FOR SEQ ID NO : 4 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

■ (ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note- "motif B' peptide from 

Saccharomyces cerevisiae EST2p" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 6 : 

Lys Cys Tyr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser 
15 10 15 

Ala Pro lie Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser 
20 25 30 

Glu Phe Lys 
35 



INFORMATION FOR SEQ ID NO : 4 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1..43 

(D) OTHER INFORMATION: /note= "motif- C and D peptide from 

Saccharomyces cerevisiae EST2p M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : A 1 : 

lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Aso Gin Gin 

1 5 10 15 

Gin Val He Asn lie Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr 
20 25 30 

Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala 
35 "40 



INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

■(A) NAME /KEY : Peptide 
(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note* "motif E peptide from 

Saccharomyces cerevisiae EST2p n 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Lys Glu Leu Glu Val Trp Lys His Ser Ser Thr 
1 5 10 



INFORMATION FOR SEQ ID NO : 4 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE : 

(A) NAME/KEY: Peptide ■- 

(B) LOCATION: 1 . . 4 

(D) OTHER INFORMATION : /note= "consensus non-telomerase RT - 

sequence from motif B ! " 

(ix) FEATURE : 

(A) NAME/KEY: Modi f ied-s i te 

(B) LOCATION: 1 

.(D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu., lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 9 : 

Xaa Pro Gin Glv 
1 



INFORMATION FOR SEQ ID NO: 50: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
■( B) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note= "consensus non-telomerase RT 

sequence from motif C" 
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(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= T 'Xaa « hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 

(B) LOCATION: 4 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa « hydrophobic amino, acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site . 

(B) LOCATION : 5 

(D) OTHER INFORMATION : /product- "OTHER" 

/note= "Xaa « hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met " 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Asp Asp Xaa Xaa Xaa 
1 5 



INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide- 



<ix) ' FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 55 

. (D) OTHER INFORMATION: /note- "motif 1 and 2 peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 

Leu Ser Asn Glu Leu Gly Thr Gly Lys Phe Lys Phe Lys Pro Met *rg 

1 5 10 - 15 

He Val Asn lie Pro Lys Pro Lys Gly Gly He Arg Pro Leu Ser Val 

20 25 30 

Gly Asn Pro Arg Asp Lys He Val Gin Glu Val Met Arg Met He Leu 

35 40 45 
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Asp Thr lie Phe Asp Lys Lys 
50 55 



INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 
■(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: peptide 



{ ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1..34 

(D) OTHER INFORMATION: /note- "motif A peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded- 
mitochondrial protein" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Phe Gly Gly Ser Asn Trp Phe He Glu Val Asd Leu Lvs Lys Cys Phe 

1 5 10 * * 15 

Asp Thr He Ser His Asp Leu He lie Lys Glu Leu Lvs Arg Tyr He 

20 25 " 30 

Ser Asc 



I U FOR;-;;-.. ION FOR SEQ ID NO:53: 

"■(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 35 

(D) OTHER INFORMATION: /note- "motif B r peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoced 
mitochondrial protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Thr Tyr His Lys Pro Met Leu Gly Leu Pro Gin Gly Ser Leu He Ser 
1 5 10 15 
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Pro He Leu Cys Asn lie Val Met Thr Leu Val Asp Asn Trp Leu Glu 
20 25 3 0 



Asp Tyr He 
35 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
<B) LOCATION: 1. .17 

(D) OTHER INFORMATION: /note- "motif C peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoced 
mitochondrial protein" 



SEQUENCE DESCRIPTION: SEQ ID NO: 54: 




Asn 



INrORMATION FOR SEQ ID NO : 5 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGT-H: 25 amino acids 

(B) TYPE: amino acid" 

(C) STRANDEDNESS: 

(D) TOPOLOGY; linear 

di) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION : 1 . . 25 

(D). OTHER INFORMATION: /note« "motif D peptide from 

Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1- encoded 
mi tochondria 1 protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Lys Met lie Lys Arg.Asp Leu Asn Asn Phe Leu Asn Ser Leu GW Leu 
5 10 15' 
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Thr lie Asn Glu Glu Lys Thr Leu lie 
20 25 



(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: ■ 1 . . 11 
(D) OTHER INFORMATION: 



/note= "motif E peptide from 
Saccharomyces cerevisiae cytochrome 
oxidase group II intron 1-encoded 
mitochondrial orotein" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO : 5 6 : 



Glu Thr Pro Ala Arg Phe Leu Gly Tyr Asn lie 
15 10 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

X A) LENGTH: 26 amino acids 

(£} TYPE: amino acid 

(C) STRANDEDNESS: 

{ Of TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 26 

(D) OTHER INFORMATION: /note- "motif 1 peptide from 

Drosophila melanogaster TART non-LTR 
re t rot ransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Ser He Leu Arg He Gly Tyr Tyr Pro Asp Ala Trp Lys His Ala Gin 
1 5 10 15 

Val Lys Met He Leu Lys Pro Gly Lys Ser 
20 25 
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INFORMATION FOR SEQ ID NO: 58: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 'amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) ■ NAME/KEY: Peptide 

(B) LOCATION: 1 . . 29 

(D) OTHER INFORMATION: /note- "motif 2 peptide from 

Drosophila melanogas ter TART non-LTR 
retrotransposabie element reverse 
transcriptase " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

Tyr Arg Pro He Ser Leu Leu 'Ser Gly Leu Ser Lys Met Phe Glu Arg 

1 5 . 10 - 15 

Leu Leu Leu Lys Arg Leu Phe Arg Val Asp Leu Phe Lys 
20 25 



INFORMATION FOR SEQ ID NO : 5 9: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: oeotide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . .34 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Drosophila melanoga ste r TART non-LTR 
re t r ot ransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 5 9 ; 

Arg Lys Glu Tyr Cys Ser Ala Val Phe Leu Aso He Ser Glu Ala Phe 
1 5 10 15 

Asp Arg' Val Trp His Glu Gly Leu Leu Leu Lys Leu Ala Lys He Leu 
20 25 _ 30 

Pro Tyr 
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(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(E) LOCATION: 1 . .35 

(D) OTHER INFORMATION: /note- "motif B' peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 

(xi)" SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

Arg Ala Gly Gin lie Gly Ala Gly Val Pro Gin Gly Ser Asn Leu Gly 
1 -■ "5 io 15 

Pro lie Leu Tyr Ser He Phe Ser Ser Asp Met Pro Leu Pro His lie 
20 25 
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Tyr His Pro 
35 



INFORMATION FOR "SEQ ID NO : 6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids * 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: .1 . . 17 

(D) OTHER INFORMATION: /note- "motif C peptide from 

Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61; 

Leu Ser Thr Tyr Ala Asp Asp Thr lie Val Leu Ser Ser Asp lie Leu 
5 10 15 

Ala 
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Information for seq id no:62: 
(i) sequence characteristics: 

(A) LENGTH: 26 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1, .26 

(D) OTHER INFORMATION: /note- "motif D peptide from 

Drosophila melanogaster TART non-LTR 
re trotransposable element reverse 
- transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

Asn GLuAsn Tyr Leu Lys fhr Phe Ser Asp Trp Ala Asp Lys Trp Gly 
1 5 10 15* 

lie Ser Vai Asn Ala Ala Lys Thr Gly His 
■ 20 .25 



INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

{ A) LENGTH : 11 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

; D) TOPOLOGY: linear 

(ii) XQLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION : .1 . 11 
(D) OTHER INFORMATION: 



/note- "motif E peptide "from 
Drosophila melanogaster TART non-LTR 
retrotransposable element reverse 
transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 



Glu Ser Lys Gin Ser Tyr Leu Gly Vai He Leu 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1..26 

(D) OTHER INFORMATION : /note- "motif 1 peptide from 

reverse transcriptase" 



SEQUENCE DESCRIPTION: SEQ ID NO: 64: 

Gly Lys He Ser Lys lie Gly Pro Glu Asn Pro Tyr Asn Thr Pro 



".e Ala lie Lys Lys Lys Asp Ser Thr 
20 25 



(xi) 

Glu 
1 

Val 



(2) INFORMATION FOR SEQ ID NO:65: 

fi) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 63 amino acids 
(3) TYPE: amino acid 
(C) S T RAN DE DN E S S : 
(0) TOPOLOGY: linear 

. di) MOLECULE TYPE: oeotide 



(ix) FEATURE: 

(A) .NAME / KEY : Peptide 
(3) LOCATION: 1 /. 63 

ID) OTHER INFORMATION: /note^ "motif 2 and A peptide from HIV- 

reverse transcriptase" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

Trp Arg Lys Leu Val Asp Phe Arg Glu Leu Asn Lys Arg Thr Gin P sp 
5 10 15 

?he Trp Glu Val Gin Leu Gly He Fro His Pro Ala Gly Leu Lys Lys 
20 25 3 0 ' 

Lys Lys Ser V ai Thr Val Leu Asp Val Gly Asp Ala Tyr Phe Val 
-5 40 - 



Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr lie Pr 
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55 - 60 



o 



57 



(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 35 

(D) OTHER INFORMATION: /note« "motif B 1 peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Gly lie Arc Tyr Gin Tyr Asn Val Leu Pro Gin Gly Trp Lys Gly Ser 
1 " 5 10 15 

Pro Ala He Phe Gin Ser Ser Met Thr Lys He Leu Glu Pro Phe Lys 
. 20 25 30 

Lys Gin Asn 
35 



!2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



7 EAT U RE : 

(A) NAME /KEY : ' Peptide 

(B) LOCATION: 1 . .'17 

'(D) OTHER "INFORMATION : /note= "motif C peptide from HIV-1 

reverse transcriptase"- 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

lie Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser Asp Leu Glu lie 
1 5 10 15 



Glv 



58 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 37 

(D) OTHER INFORMATION: /note= "motif D and E peptide from HIV-1 

reverse transcriptase" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: 

His Arg Thr Lys lie Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
1 5 10 15 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 
20 ' 25 30 

..t. 

Met Giy He Thr Leu 
35 



(2). INFORMATION FOR SEQ ID NO: 69: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(5) TYPE: amino acid 
(C-) STRANDEDNESS: 
[0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide - 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION : 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus telonerase RT finger 

sequence from motif 1" 



(xi)" SEQUENCE DESCRIPTION: SEQ ID NO;69: 

lie Pro Lys Lys 
1 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .5 

( D) OTHER INFORMATION: /note= "consensus telomerase RT palm 

primer grip sequence from motif C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

Leu Leu Leu Arg Leu 
1 5 



2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 amino' acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 4 

(D) OTHER INFORMATION: /note= "consensus telomerase RT palm 

primer grip sequence from motif C" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 i ; 

Asp Asp Phe Leu 
1 



2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 3 amino acids 
(S) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide \ 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 43 

(D) OTHER INFORMATION: /note= "telomerase specific motif T 

peptide from human TRT " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr 

15 10 15 

Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys 

20 25 30 

Ser Val Trp Ser Lys Leu Gin Ser He Gly He 
35 40 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 5 amino acids 
■(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



( ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 5 

( D ) OTHER INFORMATION: /note= "telomerase specific motif T ' 

peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Glu Ala Glu Val Arg 
1 5 



'(2) INFORMATION FOR SEQ ID NO : 7 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide ■ 



( ix) FEATURE : 

(A) NAME/ KEY : Peptide 

(B) , LOCATION : 1. . 16 

(D) OTHER INFORMATION: /note* "telomerase RT finger motif 

1 and 2 peptide from human TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 



Ser Arg Leu Arg Phe He Pro Lys Pro Asp Glv Leu Arg Pro lie Val 
1 " 5 * "10 " 15 
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(2) INFORMATION FOR SEQ ID NO : 7 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note 55 "telomerase RT finger motif A 

peptide from human TRT' f 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 

Pro Glu Leu Tyr Phe Val Lys Vai Asp Vai Thr Gly Ala Tyr Asp Thr 
1 5 10 15 

lie 



(2) INFORMATION FOR SEQ ID NO :-7 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
'(B) TYPE: amino acid 
(C) STRANDEDNESS : 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



'EAT U RE : - 

(A) NAME /KEY' : Peptide 

(B) LOCATION: 1 . . 22 

( d ) OTHER INFORMATION: /note* "telomerase R.T finger mc~if B 

peptide from human TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ _I D. NO : 76 : 

Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser lie Leu Ser Thr Leu 
1 5 10 . 15 

Leu Cys Ser Leu Cys Tyr 
20 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..13 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif C peptide from human TRT 11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr 
1 5 10 



(2) INFORMATION FOR SEQ ID NO : 7 8 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(E) LOCATION: -1. .16 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif D peptide from human TRT " 



':•:!) SEQUENCE DESCRIPTION: SEQ ID NO:78: 

Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arc Lys Thr Val Val 
1 5 * 10 15 



(2) INFORMATION FOR SEQ ID NO: 7 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids" 

(B) TYPE: amino acid 
CO STRANDEDNESS : . 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



;ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. ,11 

(D) OTHER INFORMATION: /note* "telomerase RT palm, primer grip 

motif E peptide from human TRT" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu 
1 5 10 



INFORMATION FOR SEQ ID NO : 8 0 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION.: 1. . 43 

(D) OTHER INFORMATION: /note= "telomerase specific motif T 

peptide from Schizosaccharomyces 
pombe TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID_NO:80:. 

Tro Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 

1 ^ 5 10 15 

He Thr Giu Ser Ser Asp Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys' 
20 25 30 

Aso He Tro Lys Leu Leu Cys Arg Pro Phe He 
35* * 40 



INFORMATION FOR SEQ ID NO : 8 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 5 

(D) OTHER INFORMATION: /note= "telomerase specific motif T' 

peptide from Schizosaccharomyces 
combe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Glu Asn Asn Val Arg 
1 5 
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(2) INFORMATION FOR SEQ ID NO : 8 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1. . 16 

(D) OTHER INFORMATION: /note* "telomerase RT finger motif 

1 and 2 peptide from Schi zosaccharomyce s 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

Ala Val He Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu He Thr 
1 5 " 10 15 



(2) INFORMATION FOR SEQ ID NO: 83: 

(i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 
(3) LOCATION: - 1 . .17 

(D) OTHER INFORMATION: /note= " telorr.erase RT finger motif A 

peptide from Schi zosaccha romyces 
pombe TRT" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 3 : 

Arg Lys Lys Tyr Phe Val Arg He Asp He Lys Ser Cys Tyr Asp Arg 
15 10 15 

He 



(2) INFORMATION FOR SEQ ID NO : 8 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : ' Peptide 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note- "telomerase RT finger motif B r 

peptide from Schizosaccharomyces 
pombe TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 '4 : 

Tyr Leu Gin Lys Val Gly lie Pro Gin Gly Ser lie Leu Ser Ser Phe 
1 5 10 15 



Leu Cys His Phe Tyr Met 
20 



INFORMATION FOR SEQ ID NO: 85: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:-:) FEATURE: 

(A) NAME /KEY : Peptide 
( 3 ) LOCATION : 1 . . 13 

(D) OTHER INFORMATION: /note« "telomerase RT palm, primer grip 

motif C peptide from Schizosaccharomyces 
pombe TRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:85: 

Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie Thr 
1 ~ 5 10 



INFORMATION FOR SEQ- ID. NO : 3 6 : 

(i)" SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: "linear 

(ii) MOLECULE TYPE : peptide 



(ix) FEATURE: : 
(A) NAME /KEY. : Peptide 
(3) LOCATION: 1,-16 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif D peptide from Schizosaccharomyces 
pombe TRT" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 6 : 

Gly Phe Glu Lys His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 87: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION:- /note- "telomerase RT palm, primer grip. 

motif E peptide from Schizosaccharomyces 
pombe TRT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 8*7: 

Phe Phe Gly Phe Ser Vai Asn Met Arg Ser Leu 
1 5 10 



INFORMATION FOR- SEQ ID NO: 83: 

(i) "SEQUENCE CHARACTERISTICS: 

{ A) LENGTH: 4 3 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY: linear 

(xi) KOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION : 1 . 4 3 

(D) OTHER INFORMATION: /note= "telomerase specific motif T 

peptide from Euplotes aedicula'tus p!23* 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:88: 

Trp lie Phe Glu Asp Leu Val Val Ser Leu lie Arg Cys Phe Phe Tyr 
1 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 25 30 

Asn lie Trp Asp Val He Met Lys Net Ser He 
35 40 



67 



(2) INFORMATION FOR SEQ ID NO : 8 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE .TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY; Peptide 

(B) LOCATION: 1. . 5 

(D) OTHER INFORMATION: /note- "telomerase specific motif T ' 

peptide from Euplotes aedicuiatus p!23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: 

Glu Lys Glu Val Glu 
1 5 



(2)- INFORMATION FOR SEQ ID NO: 90: 

(i) SEQUENCE CHARACTERISTICS": 

{A} LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif 

1 and 2 peptide from Euplotes 
aedicuiatus pl23" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 

Gly Lys Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO : 9 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 
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(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . 17 

(D) OTHER INFORMATION : /note= "telomerase RT finger motif A 

peptide from Euplotes aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 91: 

Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser 
1 5 10 15 

val 



(2} INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
{ B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note= "telomerase RT finger motif B' 

peptide from Euplotes aediculatus p!23 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

Tyr Lys Gin Thr Lys* Gly He Pro Gin Gly Leu Cys Val Ser Ser He 
1 5 10 15 

Leu Ser Ser Phe Tyr Tyr 
20 



(2) INFORMATION FOR SEQ IP NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1. . 13 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif C peptide from . Euplotes 
aediculatus pl23" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie Thr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



fix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif D peptide from Euplotes 
aediculatus pl23" 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
{ B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note- "telomerase RT palm, primer grip 

motif E peptide from Euplotes 
aediculatus pl23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 



Trp lie Gly lie Ser lie Asp Met Lys Thr Leu 
1 5 . 10 
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(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY ; Peptide 
{ B) LOCATION: 1. .42 

(D) OTHER INFORMATION: /note=* "telomerase specific motif T 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

Trp Leu Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr 

1 5 " 10 15 

Cys Thr Glu lie Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp 

20 25 30 

Thr Trp Asn Lys Leu He Thr Pro Phe He 
35 40 



(2; INFORMATION FOR SEQ I D. NO : 97 : 

(i) SEQUENCE CHARACTERISTICS: 
{.-.) LENGTH: 5 amino acids 
( 5 ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) ■ TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide- 
(5) LOCATION : -1 . • 5 

(D) OTHER INFORMATION: /note- "telomerase specific motif T ' 

peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 

Glu Asr. Asn Val Cys 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .11 

(D) OTHER INFORMATION; /note* "telomerase RT finger motif 1 

peptide from Saccharomyces 
cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98: 

Ser Lys Met Arg lie lie Pro Lys Lys Ser Asn 
1 5 10 



(2) INFORMATION FOR SEQ ID. NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

{-ii) MOLECULE TYPE: peptide ' 



(ix) FEATURE: . " 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1..5 

(D) OTHER INFORMATION: /note* "telomerase RT finger motif 2 

peptide from Seccha romyces 
cerevisiae EST 2 M 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

Phe Arg He He Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO.: LOO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . 17 

(D) OTHER INFORMATION : /note- "telomerase RT finger motif A 

peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
1 5 10 15 

lie 



(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino'acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



;ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1. .22 
■(D) OTHER INFORMATION: /note= "telomerase RT finger motif B 1 

peptide from Saccharomyces 
cerevisiae EST2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID~NO:101: 

Tyr lie Arg Glu Aso Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro 
l - - 5 10 15 

He Val Asp Leu Val Tyr 
20 



(2) INFORMATION FOR SEQ "I D NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide. 

(B) LOCATION: 1. .13 

(D) OTHER INFORMATION: /note= "telomerase RT palm, primer grip 

motif C peptide from Saccharomyces 
cerevisiae EST2 " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 : . 

Leu He Leu Lys leu Ala Asp Asp Phe Leu He lie Ser 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

■ (A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME /KEY : Peptide 

!d! otSr^SfORMATION: /note- "telomerase RT palm, pr.mer grip 
{U) UintL ^ motif D peptide from Saccha romyces 

cerevisiae EST2" 

"(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys lie Leu Ala 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

Id! OTHER^NFORMATION : /note- "telo.erase RT pal,, primer grip 

(D) OTHER iNturt mQtif £ peptide from saccharomyces 

cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 : 

Tro Lys His Ser S 
1 -5 



Trp Lys His Ser Ser Thr Met Asn Asn Phe His 
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(2) INFORMATION FOR SEQ ID NO: 105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : prot einjDind 

(B) LOCATION: 1 . . 10 

(D) OTHER INFORMATION: /note= "NFkappaB CS1 binding site motif" 

(xi) -SEQUENCE DESCRIPTION: SEQ ID NO:105: 
GGGRHTYYHC 10 



(2) INFORMATION FOR SEQ ID NO: 106: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11. base pairs 

(B) -TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : pro te in_bind 

(B) LOCATION : 1 . . 11 

( D } OTHER INFORMATION: /note- "NFkappaB MHC 1.2 binding 

site notif" 



(xi) SEQUENCE .DESCRIPTION: SEQ ID NO: 106: 
TGGGCTTCCC C ' 11 



(2) INFORMATION FOR SEQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :. single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE : 

(A) NAME/ KEY : protein__bind 
(3) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note* "NFkappaB CS2 binding site motif" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 
RGGGRMTYYC C 1 

(2) INFORMATION FOR SEQ ID NO: 108: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: . 

(A) NAME/KEY: protein_bind 

(B) LOCATION: 1. . 18 

(D) OTHER INFORMATION: /note* "topoisomerase II cleavage 

site motif" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 
RNYNNCNNGY NGKTNYNY 1 



(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3279 base pairs 
(3) TYPE: nucleic acic 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(8) LOCATION: 101 . .3196 

(D) OTHER INFORMATION: /note* "Euplotes aediculatus 123 kDa 

telorr.erase protein subunit (TRT)" 
/codon= (seq: "tga", aa: Cys) 
/product^ "Euplotes aediculatus 123 kDa 
telomerase protein -subunit (TRT) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109: 
AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT AACCTCAGTA € 



TTAATAAGCT CAGATTTTAA AT ATT AAT T A CAAAACCTAA ATG GAG GTT GAT GTT 

Met Glu Val Asp Val 
1 5 



GAT AAT CAA GCT GAT AAT CAT GGC ATT CAC TCA GCT CTT AAG ACT TGT 
Asp Asn Gin Ala Asp Asn His Gly He His Ser Ala Leu Lys Thr Cys 
10 . 15 20 
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GAA GAA ATT AAA GAA GCT AAA ACG TTG TAG TCT TGG ATC CAG AAA GTT 211 

Glu Glu lie Lys Glu Ala Lys Thr Leu Tyr Ser Trp lie Gin Lys Val 

25 30 35 

ATT AG A TGA AGA AAT CAA TCT CAA AGT CAT TAT AAA GAT TTA GAA GAT 259 

lie Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys A.sp Leu Glu Asp 

40 45 50 

ATT .AAA ATA TTT GCG CAG ACA AAT ATT GTT GCT ACT CCA CGA GAC TAT 307 

lie Lys lie Phe Ala Gin Thr Asn lie Val Ala Thr Pro Arg Asp Tyr 

55 60 65 

AAT GAA GAA GAT TTT AAA GTT ATT GCA AGA AAA GAA GTA TTT TCA ACT 355 

Asn Glu Glu Asp Phe Lys Val lie Ala Arg Lys Glu Val Phe Ser Thr 

70 75 80 85 

GGA CTA ATG ATC GAA CTT ATT GAC AAA TGC TTA GTT GAA CTT CTT TCA ■ 4 03 

Gly Leu Met lie Glu Leu lie Asp Lys Cys Leu Val Glu Leu Leu Ser 

90 95 100 

TCA AGC GAT GTT TCA GAT AGA CAA AAA CTT CAA " TGA TTT GGA TTT CAA 451 

Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys Phe Gly Phe Gin 

105 110 115 

CTT AAG GGA AAT CAA TTA GCA AAG ACC CAT TTA TTA ACA GCT CTT TCA 4 99 

Leu Lys Gly Asn Gln_ Leu Ala Lys Thr His Leu Leu Thr Ala Leu Ser 

- 120 125 130 

ACT CAA .AAG CAG TAT TTC TTT CAA GAC GAA TGG AAC CAA GTT AGA GCA 54 7 

Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn Gin Val Arg Ala 

135 140 145 

ATG ATT GGA AAT GAG CTC TTC CGA CAT CTC TAC ACT AAA TAT" TTA ATA 595 

Met lie Gly Asn Glu Leu Phe Arg His -Leu Tyr Thr Lys Tyr Leu lie 

150 * 155 _ 160 165 

TTC CAG CGA ACT TCT GAA GGA ACT CTT GTT CAA TTT TGC GGG AAT AAC """64 3 

?he Gin Arg Thr Ser Glu Gly Thr Leu Vai Gin Phe Cys Gly Asn Asn 

170 175 180 

GTT TTT GAT CAT TTG AAA GTC AAC GAT AAG TTT GAC AAA AAG CAA AAA 691 

Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp Lys Lys Gin Lys 

' 135 . 190 • 195 

GGT GGA GCA .GCA GAC ATG AAT GAA CCT CGA TGT TGA TCA ACC TGC AAA 739 

Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys Ser Thr Cys. Lys 

200 205 ■ 210 

TAC AAT GTC AAG AAT GAG AAA GAT CAC TTT CTC AAC AAC ATC AAC GTG 7 87 

Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn Asn lie Asn Val 

215 ' ■ ' 220 225 

CCG AAT TGG AAT AAT ATG AAA TCA AGA ACC AGA ATA TTT TAT TGC ACT 8 35 

Pro Asn Trp Asn Asn- Met Lys Ser Arg Thr Arg lie Phe Tyr. Cys Thr 

230 * 235 240 245 

CAT TTT AAT AGA AAT AAC CAA TTC TTC AAA AAG CAT GAG TTT GTG AGT 8 83 

His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His Glu Phe Val- Ser 

250 255 260 
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AAC AAA AAC AAT ATT TCA GCG ATG GAC AGA GCT CAG ACG ATA TTC ACG 931 

Asia Lys Asa Asn He Ser Ala Met Asp Arg Ala Gin Thr He Phe Thr 
265 270 275 

AAT ATA TTC AGA TTT AAT AGA ATT AGA AAG AAG CTA AAA GAT AAG GTT 97 9 

Asa He Phe Arg Phe Asn Arg He Arg Lys Lys Leu Lys Asp Lys Val 
280 ■ 285 290 

ATC GAA AAA ATT GCC TAC ATG CTT GAG AAA GTG AAA GAT TTT AAC TTC 10 2 7 

He Glu Lys He Ala Tyr Met Leu Glu Lys Val Lys Asp Phe Asn Phe 

295 300 305 

AAC TAC TAT TTA AC A AAA TCT TGT CCT CTT CCA GAA AAT TGG CGG GAA 10 7 5 

Asn tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu Asn Trp Arg Glu 

310 315 320 . 325 

CGG AAA CAA AAA ATC GAA AAC TTG ATA AAT AAA ACT AGA GAA GAA AAG 1123 

Arg Lys Gin Lys lie Glu Asn Leu He Asn Lys Thr Arg Glu Glu Lys 
330 335 340 

TCG AAG TAC TAT GAA GAG CTG TTT AGC TAC AC A ACT GAT AAT AAA TGC 1171 

Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr Asp Asn Lys Cys 
345 350 355 

GTC AC A CAA TTT ATT AAT GAA TTT TTC TAC AAT ATA CTC CCC AAA GAC 1219 

val Thr Gin Phe lie Asn Glu Phe Phe Tyr Asn He Leu Pro Lys Asp 
360 365 370 

TTT TTG ACT GGA AGA AAC CGT AAG AAT TTT CAA AAG AAA GTT AAG AAA. 12 67 

Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys Lys Val Lys Lys 

373 * 330 335 

TAT GTG GAA CTA AAC AAG CAT GAA CTC ATT CAC AAA AAC TTA TTG CTT 1315 

Tyr Val Glu Leu Asn Lys .His Glu Leu He His Lys Asa Leu Leu Leu 

350 ~ 395 400 405 

GAG AAG ATC AAT AC A AGA GAA ATA TCA TGG ATG CAG GTT GAG ACC TCT 13 63 

Glu Lys lie Asn Thr Arg Glu lie Ser Trp Met Gin Val Glu Thr Ser 
4 10 ' 4 15 4 20 

GCA AAG CAT TTT TAT TAT TTT GAT CAC GAA AAC ATC TAC GTC TTA TGG 14 11 

Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn He Tyr Val Leu Trp 
425 430 435 

AAA TTG CTC CGA TGG ATA TTC GAG GAT CTC GTC GTC TCG CTG ATT AGA 14 59 

Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val Val Ser Leu lie Arg 
4 4 0. 4 4 5 450 

TGA TTT TTC TAT GTC ACC GAG CAA CAG AAA AGT TAC TCC AAA ACC TAT 1507 

Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr 

4 55 460 465 

TAC TAC AGA AAG AAT ATT TGG GAC GTC . ATT ATG AAA ATG TCA ATC GCA 15 55 

Tyr Tyr Arg Lys Asn He Trp Asp Val lie Met Lys Met Ser He Ala 

4 70 4 75 ' 4 80 4 85 

GAC TTA AAG AAG GAA ACG CTT GCT GAG GTC CAA GAA AAA GAG GTT GAA 160 3 

Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu Val Glu 
4 90 495 500 
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GAA TGG AAA AAG TCG CTT GGA TTT GCA CCT GGA AAA CTC AGA CTA ATA 
Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie 
505 510 515 
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CCG AAG AAA ACT ACT TTG CGT CCA ATT ATG ACT TTC AAT AAG AAG ATT 
Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys lie 
520 525 530 
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GTA AAT TCA GAC CGG AAG ACT ACA AAA TTA ACT ACA AAT ACG AAG TTA 
Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu 
535 540 545 



174 7 



TTG AAC TCT CAC TTA ATG CTT AAG ACA TTG AAG AAT AGA ATG TTT AAA 
Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met Phe Lys 
550 555 560 565 
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GAT CCT TTT GGA TTC GCT GTT TTT AAC TAT GAT GAT GTA ATG AAA AAG 
Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys 
570 575 580 
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TAT GAG GAG TTT GTT TGC AAA TGG AAG CAA GTT GGA CAA CCA AAA CTC 
Tyr Glu Glu Phe Val Cys Lys- Trp Lys Gin" Val Gly Gin Pro Lys Leu 
585 * ' 590 ' 595 



1891 



TTC TTT GCA ACT ATG GAT ATC GAA AAG TGA TAT GAT AGT GTA AAC AGA 
Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser Val Asn Arg 
600 605 610 



1939 



GAA AAA CTA TCA ACA TTC CTA AAA ACT ACT AAA TTA CTT TCT TCA GAT 
Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr, Lys Leu Leu Ser Ser Asp 
615 620 ■ 625 



1987 



? he 
6 30 



TGG ATT ATG ACT GCA CAA ATT CTA AAG AGA AAG AAT AAC ATA GTT 
Trp He Met: Thr Ala Gin He Leu Lys Arg Lys Asn Asn He Val 
63 5 64 0 64 5 
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ATC GAT TCG AAA AAC TTT AGA AAG AAA GAA. ATG AAA GAT TAT TTT AGA 
lie Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys Asp Tyr Phe Arg 
650 - 655 660 
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CAG AAA TTC CAG AAG ATT GCA CTT GAA GGA GGA CAA. TAT CCA ACC TTA 
Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin Tyr Pro Thr Leu 
665 ' 670 675 
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TTC AGT GTT CTT GAA. AAT GAA CAA AAT GAC TTA AAT GCA AAG AAA "ACA 
Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn Ala Lys Lys Thr 
680 685 690 
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TTA ATT GTT GAA GCA AAG CAA AGA AAT TAT TTT AAG AAA GAT AAC TTA 222 7 

Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys Lys Asp Asn Leu 

695 700 705 

CTT CAA CCA GTC ATT AAT ATT TGC CAA TAT AAT TAC ATT AAC TTT - AAT 2275 

Leu Gin Pro Val He Asn He Cys Gin Tyr Asn Tyr He Asn Phe Asn 

710 715 * 720 725 



GGG 
Glv 



AAG TTT TAT AAA CAA ACA AAA GGA ATT CCT CAA GGT CTT TGA GTT 
Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val 



730 



735 



740 



2323 
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TCA TCA ATT TTG TCA TCA TTT TAT TAT GCA ACA TTA GAG GAA AGC TCC 2 371 

Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu Ser Ser 

745 750 755 

TTA GGA TTC CTT AG A GAT GAA TCA ATG AAC CCT GAA AAT CCA AAT GTT 24 19 

Leu Gly Phe Leu Arg Asp Glu Ser Met Asa Pro Glu Asn Pro Asa Val 

760 765 770 

AAT CTT CTA ATG AG A CTT ACA GAT GAC TAT CTT TTG ATT ACA ACT CAA 24 67 

Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin 

775 780 785 

GAG AAT AAT GCA GTA TTG TTT ATT GAG AAA CTT ATA AAC GTA AGT CGT 2515 

Glu Asn Asn Ala Val Leu Phe. He Glu Lys Leu He Asn Val Ser Arg 

790 " 795 800 . 805 

GAA AAT GGA TTT AAA TTC AAT ATG AAG AAA CTA CAG ACT AGT TTT CCA 25 63 

Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser Phe Pro 

810 . 815 820 

TTA AGT CCA AGC AAA TTT GCA AAA TAC GGA ATG GAT AGT GTT GAG GAG 2 611 

Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met Asp Ser Val Glu Glu 

825 ' 830 835 

CAA AAT ATT GTT CAA GAT TAG TGC GAT TGG ATT GGC ATC TCA ATT GAT 2 65 9 

Gin Asn He Val Gin Asp Tyr Cys Asp Trp lie Gly -He Ser He Asp 

840 " 845 850- 

ATG AAA ACT CTT GCT TTA ATG CCA AAT ATT AAC TTG AG A ATA GAA GGA 2 70 7 

Met Lys Thr Leu Ala Leu Met Pro Asn He Asn Leu Arg He Glu Gly 

855 860 865 

ATT CTG TGT ACA CTC AAT CTA AAC ATG CAA ACA AAG AAA GCA TCA ATG 2 75 5 

He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys Lys Ala Ser Met . 

370 875 380 335 

TGG CTC AAG AAG AAA CTA AAG TCG TTT TTA ATG AAT AAC ATT ACC CAT 2 80 3 

Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn Asn lie Thr His 

8 90 3 9-5 900 

TAT TTT AG A. AAG ACG ATT ACA ACC GAA GAC TTT GCG AAT AAA ACT CTC 28 51 

Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe Ala Asn Lys Thr Leu 

905 910 * 915 

AAC AAG TTA TTT ATA TCA GGC GGT TAC AAA TAC ATG CAA TGA GCC AAA 28 9 9 

Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr Met Gin Cys Ala Lys 

920 925 930 

GAA TAC AAG GAC CAC TTT AAG AAG AAC TTA GCT ATG AGC AGT ATG ATC 2 94 7 

Glu Tyr Lys Asp His Phe Lys Lys Asa Leu Ala Met Ser Ser Met He 

935 " 940 945 

GAC TTA GAG GTA TCT AAA ATT ATA TAC TCT GTA ACC AGA GCA TTC TTT 2 99 5 

Asp Leu Glu Val Ser Lys lie lie Tyr Ser Val Thr Arg Ala Phe Phe 

950 955 960 965 

AAA TAC CTT GTG TGC AAT ATT AAG GAT ACA ATT TTT GGA GAG GAG CAT 30 4 3 

Lys Tyr Leu Val Cys Asn lie Lys Asp Thr He Phe Gly Glu Glu His 

970 9^5 980 
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TAT CCA GAC TTT TTC CTT AGC AC A CTG AAG CAC TTT ATT GAA ATA TTC 3091 

Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe lie Glu lie Phe 
935 990 995 

AGC ACA AAA AAG TAC ATT TTC AAC AGA GTT TGC ATG ATC CTC AAG GCA 313 9 

Sar Thr Lys Lys Tyr lie Phe Asa Arg Val Cys Met lie Leu Lys Ala 
1000 1005 1010 

AAA GAA GCA AAG CTA AAA AGT GAC CAA TGT CAA TCT CTA ATT CAA TAT 318 7 

Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin Ser Leu He Gin Tyr 
1015 1020 1025 

GAT GCA TAGTCGACTA TTCTAACTTA TTTTGGAAAG TTAATTTTCA ATTTTTGTCT 324 3 

Asp Ala 

1030 

TATATACTGG GGTTTTGGGG TTTTGGGGTT TTGGGG 327 9 



(2) INFORMATION EOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 amino acids 

(B) TYPE: amino acid 
(O) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Xe: Glu Val Aso Val Asp Asn Gin Ala Aso Asn His Gly lie His Ser 

1 "5 10 15 

Ala Leu Lys Thr Cys Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser 

20 25 30 

7rp lie Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr 

35 40 45 

Lys Asp Leu Glu Asp He Lys He Phe Ala Gin Thr Asn He Val Ala 

50 5 5 60 

Thr Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys 

65 ' 70 " 75 80 

Glu Val Phe Ser Thr Gly Leu Met lie Glu Leu He Asp Lys Cys Leu 

85 90 95 

Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 

100 105 * 110 

Cys Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu 

115 120 125 

Leu Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp 

130 135 " 140 

Asn Gin Val Arg Ala Met lie Gly Asn Glu Leu Phe Arg His Leu Tyr 

145 150 155 160 
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Thr Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin 
165 170 175 

Phe Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Aso Lys Phe 
180 - 185 19 o 

Asp Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys 
195 200 205 

Cys Ser Thr Cys Lys Tyr Asn Val . Lys Asn Glu Lys Asp His Phe Leu 
LU 215 220 

Asn Asn lie Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg 

230 235 2 40 

lie Phe Tyr Cys Thr His Ph^ Asn Arg Asn Asn Gin Phe" Phe Lys Lys 
245 250 255 

His Glu Phe Val Ser Asn Lys Asn Asn He Ser Ala Met Asd Arq Ala 
260 265 270 

Gin Thr lie Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys 
273 280 285 ■ 

Leu Lys Asp Lys Val He Glu Lys lie Ala Tyr Met Leu Glu Lys Val 
290 295 . 30Q 

Lvs Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro 



310 



315 



320 



Giu Asn Trp Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys 
325 330 335 

Thr Arg Giu Giu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe S-- Ty- Th- 
. 340 " 345 350 

Thr Asp Asn Lys Cys Val Thr Gin Phe lie Asn Glu Phe Ph = Tyr Asn 
355 350 365 

He Leu Pro Lys Aso Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin. 
370 375 380 

Lys Lys Val Lys Lys Tyr Vai Glu Leu Asn Lys His Giu Leu He His 

390 ■ 395 4 00 

Lys Asn Leu Leu Leu Glu Lys He Asn Thr Arg Glu lie Ser Trp M»t 
405 410 " 

Gin Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Aso His Glu Asn 
420 425 ■ 430 

He Tyr Val Leu Trp Lys Leu Leu Arg Trp He Phe Glu Aso Leu Val 
435 440 445 

: -'al Sec Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys S°r 
450 455 4S0 



Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn He Tro Aso Va> He Met 

470 475 ■ 4 80 
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Lys Met Ser lie Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin 
485 .4 90 495 

Glu Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly -Phe Ala Pro Gly 
500 505 510 

Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr 
515 520 525 

Phe Asn Lys Lys lie Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr 
530 535 540 

Thr Asn Thr Lys Leu Leu Asa Ser His Leu Met Leu Lys Thr Leu Lys 
545 550 555 560 

Asn Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp 
565 570 ■ 575 

Asp Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val 
580 585- 590 

Glv Gin Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr 
595 600 605 

Aso Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
610 615 620 ■ 

Leu Leu Ser Ser Asp Phe Trp lie Met Thr Ala Gin lie Leu Lys Arg 
625 630 635 640 

Lys Asn Asn lie Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu. Met 
645 * 650 655 

Lys Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly 
660 665 670 

Gin Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu 
675 630 635 

Asn Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe 
690 * 695 " 700 

Lys Lys Asp Asn Leu Leu Gin Pro Val He Asn lie Cys Gin Tyr Asn 
705 / 710 715 720 

Tyr lie Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro 
725 730 735 

Gin Gly Leu Cys Val Ser Ser lie Leu Ser Ser Phe Tyr Tyr Ala Thr 
740 745 750 

Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro 
755 760 765 

Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu 
770 775 . 780 



Leu lie Thr Thr Gin Glu Asn- Asn Ala Val Leu Phe He Glu Lys Leu 
785 790 795 800 



83 



He Asn Val Ser Arg Glu Asn Giy Phe Lys Phe Asn Met Lys Lys Leu 
805 810 815 

Gin Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Giy Met 
320 - 825 830 

Asp Ser Val Glu Glu Gin Asn lie Val Gin Asp Tyr Cys Asp Trp lie 
835 840 845 

Giy lie Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn 
850 855 360 

Leu Arg He Glu Giy He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr 
365 870 875 880 

Lys Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met 
385 890 ' 895 

Asn Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe 
900 905 910 

Ala Asn Lys Thr Leu Asn Lys Leu Phe lie Ser Giy Giy Tyr Lys Tyr 
915 . 920 925 

Met Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lvs Asn Leu Ala 
930 935 * 940 

Met Ser Ser Met He Asp Leu Glu Val Ser Lys He He Tyr Ser Val 
945 950 955 960 

Thr Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He 
965 970 * 975 

Phe Giy Giu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His 
930 - 935 990 

Phe He Glu He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 " 1000 1005 

Met He Leu Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin 
1010 1015 1020 

Ser Leu He Gin Tyr Asp Ala 
1025 1030 



(2) INFORMATION TOR SEQ ID MO : 1 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DMA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

( B ) LOCATION: join (959 1216, 1273.. 1353, 1425.. 1543, 

1595. .1357, 1894. .2286, 2326. .2396, 2436. .2705, 
2746. .2862, 2914. ,3083, 3125. .3309, 3356. .3504, 
354 6. .3759, 3797. .4046, 4086. .4252, 4296. .4392, 
4435 . .4597) 

(D) OTHER INFORMATION: /note* " Schi zosaccharomyces pombe 

telomerase catalytic subunit (TRT) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

GGTACCGATT TACTTTCCTT TCTTCATAAG CTAATTGCTT CCTCGAACGC TCCTAAATCT 60 

CTGGAAATAT TTTTACAAGA ACTCAATAAC AATACCAAGT CAAATTCCAA TATGAAGGTG 120 

TTATTAGTGA TCGATAATAT TTCTATTTTA TCGGTCGTTA CCAAGTATAA GGACAAAAAG m 18 0 

AACAACTTCC TTCCCCCTAA AGACTTTTAC TTTATTAATT TACTTTTCAA ATATATTTCG 240 

GGTTCGCTTA CTTTTAATCG TGGTACTGTT TTAGCTGCTA CTTCTAGCCA ACCGCGTGTT 300 

TC7ACCCCGT CATTGGATAT AGCTCTTGGA GTAGCTCACA GAAATCCTTA CAAATCTTCT 3 60 

GATGAGACTA TATTAGATTC ATTACAGTCC GTGCATATTC TTAACATGGA GCCTTACACT 4 20 

TTAGATGAGT CACGTCGCAT GATGGAGTAT TTGGTATCAT CCAACGTTTG CCTTGAAAAG 4 80 

GTTGATAATT ATTTGCAAAA TCATGTCCTT AGTGGTGGTA AT CCGCG AAA GTTTTTTGAT 54 0 

GC77GCACAC GTCTAGCATG ATTGAGATAT TCAAAA-ATTT CTATCCACTA CAACTCCTTT 600 

AACGCGGTTT TATTTTTCTA TTTTCTATTC TCATGTTGTT CCAAATATGT ATCATCTCGT 6 60 

A77AGGCT7T TTTCCGTTTT ACTCCTGGAA TCGTACCTTT TTCACTATTC CCCCTAATGA -7 20 

A7AATCT AAA. TTAG7TTCGC 7TATAA.TTGA TAG TAG TAG A AAGATTGG7G ATTCTACTCG 7 80 

TG7AATGTTA 7TAGTTTAAA GATACTTTGC AAA.ACATTTA TTAGCTATCA T T A T A T AAAA 8 40 

AAAATCCTAT AATTATAAAT ATTAATCAAT ATTTGCGGTC ACTATTTATT TAAAACGTTA 900 

TGATCAGTAG GACACTTTGC ATATATATAG TTATGCTTAA TGGTTACTTG TAACTTGC 958 

ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC TTT CTA GAG 100 6 

Mei: Thr Glu His His Thr Pro Lys Ser Arg lie Leu Arg ?he Leu Glu 
1 5 10 15 

AAT CAA TAT GTA TAC CTA TGT ACC TTA AAT GAT TAT GTA CAA CTT GTT 10 5 4 

Asn Gin Tyr Val Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

TTG AG A GGG TCG CCG GCA AGC TCG TAT AGC AAT ATA TGC GAA CGC TTG 1102 
Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn lie Cys Glu Arg Leu 
35 40 45 

AG A AGC GAT GTA CAA ACG TCC TTT- TCT ATT TTT CTT CAT TCG ACT GTA 1150 
Arg Ser Asp Val Gin Thr Ser Phe Ser He Phe Leu His Ser Thr Val 
50 55 ' 60 
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GTC GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA TTT TCT TCT CCA 
Val Gly Phe Asp Ser Lys Pro Asp Glu Gly Val Gin Phe Ser Ser Pro 
65 70 75 80 

AAA TGC TCA CAG TCA GAG GTATATATAT TTTTGTTTTG ATTTTTTTCT 
Lvs Cys Ser Gin Ser Glu 
85 

ATTCGGGATA GCTAATATAT GGGCAG CTA ATA GCG AAT GTT GTA AAA CAG ATG 

Leu lie Ala Asn Val Val Lys Gin Met 
90 95 

TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA CTG ATG AAA GGG TTT 
Phe Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe 
100 105 110 ' 

TCC ATG GTAAGGTATT CTAATTGTGA AATATTTACC TGCAATTACT GTTTCAAAGA 
Ser Met 



GATTGTATTT AACCGATAAA G AAT CAT GAA GAT TTT CGA GCC ATG CAT GTA 

Asn His Glu Asp Phe Arg Ala Met His Val 
115 120 

AAC GGA GTA CAA AAT GAT CTC GTT TCT ACT TTT CCT AAT TAG CTT ATA 

Asn Gly Val Gin Asn Asp Leu Val Ser Th-r Phe Pro Asn Tyr Leu He 

125_ " 130 135 

TCT ATA CTT GAG TCA "AAA AAT TGG CAA CTT TTG TTA GAA AT 

Ser lie Leu Glu Ser Lys Asn Trp Gin Leu Leu Leu Glu He 
14 0 145 150 

GTAAATACCG GTTAAGATGT TGCGCACTTT GAACAAGACT GACAAGTATA G T ATC 

lie 



GGC AGT GAT GCC ATG CAT TAG TTA TTA TCC AAA GGA AGT ATT TTT GAG 

Gly Ser Asp Ala Met His Tyr Leu Leu Ser Lys Gly Ser He Phe Glu 

155 * 160 16S 170 

GOT CTT CCA AAT GAC AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT 

Ala Leu Pro Asn Asp Asn Tyr Leu Gin He Ser Gly He Pro Leu Phe 

175 180 135 

AAA AAT AAT GTG TTT GAG GAA ACT GTG TCA AAA AAA AG A AAG CGA ACC 

Lys Asn Asn Val Phe Glu Glu Thr Val Ser Lys Lys Arg Lys Arg Thr 
190 195 200 

ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA GAA GTT TCC 

He Glu Thr Ser He Thr Gin Asn Lys Ser Ala Arg Lys Glu Val Ser 

205 210 215 

TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC 

Tro Asn Ser He Ser He Ser Arg Phe Ser He Phe Tyr Arg Ser Ser 

220 225 230 

TAT AAG AAG TTT AAG CAA G GTAACTAATA CTGTTATCCT TCATAACTAA 

Tyr Lys Lys Phe Lys Gin 

235 " 240 
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TTTTAG AT CTA TAT TTT AAC TTA CAC TCT ATT TGT GAT CGG AAC AC A 193 4 

Asp Leu Tyr Phe Asn Leu His Ser lie Cys Asp Arg Asn Thr 
245 250 

GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG CAA TTT GGA CTT ATA 193 2 

Val His Met Trp Leu Gin Trp lie Phe Pro Arg Gin Phe Gly Leu He 
255 260 265 270 

AAC GCA TTT CAA GTG AAG CAA TTG CAC AAA GTG ATT CCA CTG GTA TCA 2030 
Asn Ala Phe Gin Val Lys Gin Leu His Lys Val He Pro Leu Val Ser 
275 230 285 

CAG AGT ACA GTT GTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT 207 8 

Gin Ser Thr Val Val Pro Lys Arg Leu Leu Lys Val Tyr Pro Leu He 
290 . 295 300 

GAA CAA ACA GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC 212 6 

Glu Gin Thr Ala Lys Arg Leu His Arg He Ser Leu Ser'Lys Val Tyr 
305 310 315 

AAC CAT TAT TGC CCA TAT ATT GAC ACC CAC GAT GAT GAA AAA ATC CTT 217 4 

Asn His Tyr Cys Pro Tyr He Asp Thr His Asp Asp Glu Lys He Leu 
320 325 330 

AGT TAT TCC TTA AAG CCG AAC CAG GTG TTT GCG TTT CTT CGA TCC ATT 2 222 

Ser Tyr Ser Leu Lys Pro Asn Gin Val Phe Ala Phe Leu Arg Ser He 
335 340 - 345 350 

CTT GTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA TTT 2270 
Leu Val Arg Val Phe Pro Lys Leu He Trp Gly Asn Gin Arc He Phe 
355 360 365 

GAG ATA ATA TTA AAA G GTATTGTATA AAATTTATTA CCACTAACGA TTTTA.CCAG AC 

2 327 

Glu lie He Leu Lys Asp 
370 

CTC GAA ACT TTC TTG AAA TTA TCG AG A TAC GAG TCT TTT AGT TTA CAT 2 37 5 

Leu Glu Thr Phe Leu' Lys Leu Ser Arg Tyr Glu Ser Phe Ser Leu His 
375 380 335 

TAT TTA ATG AGT AAC ATA AAG GTAATATGCC AAATTTTTTT ACCATTAATT 24 26 

Tyr Leu Met Ser Asn He Lys 
390 . 395 

AACAATCAG ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA AAA AGG TCA 24 7 4 

He Ser Glu He Glu Trp Leu Val Leu Gly Lys Arg Ser 
400 405 

AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT 2 5 22 

Asn Ala Lys LMet Cys Leu Ser Asp Phe Glu Lys Arg Lys Gin He Phe 
410 ' 415 420 

GCG GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA 2 5 70 

Ala Glu Phe He Tyr Trp Leu Tyr Asn Ser Phe He He Pro He Leu 
425 430 435 440 

CAA TCT TTT TTT TAT ATC ACT GAA' TCA AGT GAT TTA CGA AAT CGA ACT 2618 
Gin Ser Phe Phe Tyr He Thr Glu Ser Ser Asp Leu Arg Asn Arg Thr 
445 450 4 55 
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GTT TAT TTT AGA AAA GAT ATT TGG AAA CTC TTG TGC CGA CCC TTT ATT 2 666 

Val Tyr Phe Arg Lys Asp lie Trp Lys Leu Leu Cys Arg Pro Phe lie 
460 " 465 470 

ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA ATA AAC GAG GTATTTTAAA 27 15 

Thr Ser Met Lys Met Glu Ala Phe Glu Lys lie Asn Glu 
475 430 485 

GTATTTTTTG CAAAAAGCTA ATATTTTCAG AAC AAT GTT AGG ATG GAT ACT CAG 27 69 

Asn Asn Val .Arg Met Asp Thr Gin 
490 

AAA ACT ACT TTG CCT CCA' GCA GTT ATT" CGT CTA TTA CCT AAG AAG AAT 2817 
Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys Asn 
495 500 505 

ACC TTT CGT CTC ATT ACG AAT TTA AGA AAA AGA TTC TTA ATA AAG 2 8 62 

Thr Phe Arg Leu Tie Thr Asn Leu Arg Lys Arg Phe Leu lie Lys 
510 515 520 

GTATTAATTT TTGGTCATCA ATGTACTTTA CTTCTAATCT ATTATTAGCA G ATG GGT 2 919 

Met Gly 
525, 

' TCA AAC AAA. AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTG 2967 
Ser Asn Lys Lys Met Leu Val Ser Thr Asn Gin Thr Leu Arg Pro Val 
530 535 540 

GCA TCG ATA CTG AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA 3015 
Ala Ser He Leu Lys His Leu lie Asa Glu Glu Ser Ser Gly He Pro 
545 550 555 

TTT AAC TTG GAG GTT TAC ATG AAG CTT CTT ACT TTT AAG AA.G GAT CTT 30 63 

Phe Asn Leu Giu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu 
560 5-65 570 

CTT AAG CAC CGA ATG TTT GG GTAATTATAT AATGCGCGAT TCCTCATTAT 3113 
Leu Lys His Arg Met Phe Gly 
575 530 

TAATTTTGCA G G CGT AA.G AAG TAT TTT GTA'CGG'ATA GAT ATA AAA TCC 3161 
Arg Lys Lys Tyr Phe Val Arg He Asp He Lys Ser 
535 _ 590 

TGT TAT GAT CGA ATA AAG CAA- GAT TTG ATG TTT CGG ATT GTT AAA AAG 3209 
Cys Tyr Asp Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys Lys 
595 600 605 

AAA CTC AAG GAT CCC GAA TTT GTA ATT' CGA AAG TAT GCA ACC ATA CAT 32 5 7 

Lys Leu Lys Asp Pro Glu Phe Val He Arg Lys Tyr Ala Thr He His 
610 615 620 625 

GCA ACA AGT GAC CGA GCT ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC 3 30 5 

Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe Ser 
630 635 640 

TAT T GTA.AGTTT AT TTTTTCATTG GAATTTTTTA ACAAATTCTT TTT TAG TT 3 3 57 

Tyr Phe 
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GAT ATG GTG CCT TXT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA AC A 3405 
Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser Met Lys Thr 
645 650 655 

TCA GAT ACT TTG TTT GTT GAT TXT GTG GAT TAT TGG ACC AAA AGT TCT 34 53 

Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr Lys Ser Ser 
660 665 670 675 

TCT GAA ATT TTT AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT 3501 
Sec Glu lie Phe Lys Met Leu Lys Glu His Leu Ser Gly His lie Val 
680 685 690 

AAG GTATACCAAT TGTTGAATTG TAATAACACT AATGAAACTA G ATA GGA AAT 355 4 

Lys He Gly Asn 

695 

TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GGC TCA ATT CTG TCA 3602 
Ser Gin Tyr Leu Gin Lys Val Gly lie Pro Gin Gly Ser lie Leu Ser 
700 705 710 

TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA 3 650 

Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu 
715 720 725 

TCG TTT ACG AAA "AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT 3698 
Ser Phe Thr Lys Lys Lys Gly Ser Val Lea Leu Arg Val Val Asp "Asp 
730 " 735 740 

TTC CTC TTT ATA AC A GTT AAT AAA AAG GAT GCA AAA AAA XXX XXG AAX 37 4 6 

Phe Leu Phe lie Thr Val Asn Lys Lys Asp Ala Lys Lys Phe Leu Asn 
745 750 * 755 

TTA TCT TTA AG A G GTGAGTTGCT GTCATTCCTA AGTTCTAA.CC GTTGAAG GA 37 9 8 

Leu Ser Leu Arg Gly 

■■ *c j 

TTT GAG AAA CAC AAT TTT TCT ACG AGC CTG GAG AAA ACA GTA ATA AAC 33 4 6 

Phe Glu Lys His Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie Asn 

765 ' 770 775 - 780 

TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC 38 94 

Phe Glu Asn Ser Asn Gly lie He Asn Asn Thr Phe Phe Asn Glu Ser 
785 790 795 

AAG AAA AG A ATG CCA TTC TTC GGT TTC TCT GTG AAC ATG AGG TCT CTT 39 4 2 

Lys Lys Arg Met Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu 
800 * 805 810 

GAT ACA TTG TTA GCA TGT CCT AAA ATT GAT GAA GCC TTA TTT AA.C, TCT 3 9 90 

Asp Thr Leu Leu Ala Cys Pro Lys lie Asp Glu Ala Leu Phe Asn Ser 
815 820 825 

ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG AAA TCT TTT TTT TAC AAA 4038 
Thr Ser Val Glu Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys 
830 835 840 

ATT CTA AG GTATACTGTG T AA.C T G AAT A AT AG CT GAC A AAT AAT CAG A TCG 4 08 9 

Xle Leu Arg Ser 

845 
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AGC CTT GCA TCC TTT GCA CAA GTA TTT ATT GAC ATT ACC CAC AAT TCA 4 137 

Ser Leu Ala Ser Phe Ala Gin Val Phe lie Asp He Thr His Asn Ser 
850 855 860 

AAA TTC AAT TCT TGC TGC AAT ATA TAT -AGG CTA GGA TAG TCT ATG TGT 4 18 5 

Lys Phe Asa Ser Cys Cys Asn He Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

ATG AG A GCA CAA GCA TAC TTA AAA AGG ATG AAG GAT ATA TTT ATT CCC 4 23 3 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Met Lys Asp He. Phe He Pro 
885 890 895 

CAA AGA ATG TTC ATA ACG G GTGAGTACTT ATTTTAACTA GAAAAGTCAT 4 28 2 

Gin Arg Met Phe ■ He Thr 
900 

TAATTAACCT TAG AT. CTT TTG AAT GTT ATT GGA AGA AAA ATT TGG AAA ■ 4 330 

Asp- Leu Leu Asn Val He Gly Arg Lys He Trp Lys 
905 ' 910 

AAG TTG GCC GAA ATA TTA GGA TAT ACG AGT AGG CGT TTC TTG TCC TCT 4 37 8 

Lys Leu Ala Glu He Leu Gly Tyr Thr Ser Arg Arg Phe Leu Ser Ser 
915 920 925 930 ' 

GCA GAA GTC AAA TG GTACGTGTCG GTCTCGAGAC TTCAGCAATA TTGACACATC 4 4 32 

Ala Glu Val Lys Trp 
935 

AG G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA CCC TCT TTC AAA 4 4 30 

Leu Phe . Cys Leu Gly Met Arg Asp Gly .Leu Lys^ Pro Ser Phe Lys 
940 945 1 950 

TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4 5 28 

Tyr His Pro Cys Phe Glu Gin Leu lie Tyr Gin Phe Gin Ser Leu Thr 
955 - 950 965 

"GAT CTT ATC AAG CCG CTA AGA CCA GTT TTG CGA CAG GTG TTA TTT TTA 4 576 

Asp Leu He Lys Pro Leu Arg Pro Val Leu Arg Gin Val Leu Phe Leu 
970 975 - 930 

CAT AGA AGA ATA GCT GAT TAATGTCATT TTCAATTTAT TAT AT AC ATC 4 624 
Kis Arg Arg He Ala Asp 
93 5 

CTTT ATTACT GGTGTCTTAA ACAATATTAT TACTAAGTAT AGCTGACCCC CAAAGCAAGC 4 68 4 

AT AC TAT AGG ATTTCTAGTA AAGTAAAATT AATCTCGTTA TTAGTTTTGA TTGACTTGTC 4744 

TTTATCCTTA TACTTTTAAG AAAGATTGAC AGTGGTTGCT GACTACTGCC CACATGCCCA 4 804 

TT AAACGGGA GTGGTTAAAC ATTAAAAGTA AT AC ATG AGG CTAATCTCCT TTCATTTAGA 4 864 

ATAAGGAAAG TGGTTTTCTA TAATGAATAA TGCCCGCACT AATGCAAAAA G ACG AAG ATT 4 924 

ATCTTCTAAA CAAGGGGGAT TAAGCATATC CGAAGGAAAA GAGAGTAATA TACCCAGTGT 4 93 4 

TGTTGAAGAA- AGCAAGGATA ATTTGGAACA AGCTTCTGCA GATGACAGGC TAAATTTTGG 304 4 

TGACCGAATT TTGGTAAAAG CCCCAGGTTA TCCATGGTGG CCGGCCTTGC TACTGAGACG 510 4 

AAAAGAAACT AAGGATAGTT TGAATACTAA TAGCTCATTT AATGTCTTAT ATAAGGTTTT 5164 
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GTTTTTTCCT GACTTCAATT TTGCATGGGT GAAAAGAAAT AGTGTTAAGC CATTATTGGA 522 4 

TTCCGAAATA GCCAAATTTC TTGGTTCCTC AAAGCGGAAG TCTAAAGAAC TTATTGAAGC 528 4 

TTATGAGGCT TCAAAAACTC CTCCTGATTT AAAGGAGGAA TCTTCCACCG ATGAGGAAAT 5 34 4 

GGATAGCTTA TCAGCTGGTG AGGAGAAGCC TAATTTTTTG CAAAAAAGAA AATATCATTG 5 4 04 

GGAGACATCT CTTGATGAAT CAGATGCGGA GAGTATCTCC AGCGGATCCT TGATGTCAAT 54 64 

AACTTCTATT TCTGAAATGT ATGGTCCTAC TGTCGCTTCG ACTTCTCGTA GCTCTACGCA 5 52 4 

GTTAAGTGAC CAAAGGTACC 554 4 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 988 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear ■ 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

Met Thr Glu His His Thr Pro Lys Ser Arg He Leu Arg Phe Leu Glu 
1 -5 .10 15 

Asn Gin Tyr Vai Tyr Leu Cys Thr Leu Asn Asp Tyr Val Gin Leu Val 
20 25 30 

Leu Arg Gly Ser Pro Ala Ser Ser Tyr Ser Asn He Cys Glu Arg Leu 
35 40 ' 45 

Arg Ser Asp Val Gin Thr Ser Phe Ser lie Phe Leu His Ser Thr Vai 
50 55 60 

Vai Gly Phe Asp Ser Lys Pro Aso Glu Gly Val Gin Phe' Ser Ser Pro 
65 70 75 80 

Lvs Cys Ser Gin Ser Glu Leu He Ala Asn Val Val Lys Gin Met Phe 
85 90 95 

Asp Glu Ser Phe Glu Arg Arg Arg Asn Leu Leu Met Lys Gly Phe Ser 
100 105 110 

Met Asn His Glu Asp Phe Arg Ala Met His Val Asn Gly Val Gin Asn 
115 12.0 ' 125 

Asp Leu Val Ser Thr Phe Pro Asn Tyr Leu He Ser lie Leu Glu Ser 
130 135 140 

Lys Asn Trp Gin Leu Leu Leu Glu lie lie Gly' Ser Asp Ala Met His 
145 150 155 160 

Tvr Leu Leu Ser Lys Gly Ser He Phe Glu Ala Leu Pro Asn Asp Asn 
165 170 175 

Tyr Leu Gin He Ser Gly He Pro Leu Phe Lys Asn Asn Val Phe Glu 
180 185 190 
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Giu Thr Vai Ser Lys Lys Arg Lys Arg Thr He Giu Thr Ser lie Thr 
195 200 205 

Gin Asn Lys Ser Ala Arg Lys Giu Vai Ser Trp Asn Ser He Ser He 
210 .215 * 220 

Ser Arg Phe Ser He Phe Tyr Arg Ser Ser Tyr Lys Lys Phe Lys Gin 
225 230 235 240 

Asp Leu Tyr Phe Asn Leu His Ser He Cys Asp Arg. Asn Thr Vai His 
245 250 . 255 

Met Trp Leu Gin Trp He Phe Pro Arg Gin Phe Gly Leu He Asn Ala 
260 265 270 

Phe Gin Vai Lys Gin Leu His Lys Vai lie Pro Leu Vai Ser Gin Ser 
275 280 285 

Thr Vai Vai Pro Lys Arg Leu Leu Lys Vai Tyr Pro Leu He Giu Gin 
290 ■ 295 300 

Thr Ala Lys Arg Leu His Arg He Ser Leu Ser Lys Vai Tyr Asn His 
305 310 315 320 

Tyr Cys Pro Tyr He Asp Thr His Asp Asp Giu Lys He Leu Ser Tyr 
325 3 JO 335 

Ser Leu Lys Pro Asn Gin Vai Phe Ala Phe Leu Arg Ser He Leu Vai 
340 345 . 350 

Arc Vai Phe Pro Lys Leu He Trp Gly Asn Gin Arg He Phe Giu lie 
335 360 365 

He Leu Lys Asp Leu Giu Thr Phe Leu Lys- Leu Ser Arg Tyr Giu Ser 
370 - " 375" 330 

~?he Ser Leu His Tyr Leu Met Ser Asn lie Lys He Ser Giu He Giu 
335 390 395 400 

Trp Leu Vai Leu Gly Lys Arg Ser Asn Ala Lys Met Cys Leu Ser Asp 
4 05 4 10 4 15 

Phe Giu Lys Arg Lys Gin He Phe Ala Giu Phe lie Tyr Trp Leu Tyr 
420 425 430 

Asn Ser Phe He He Pro lie Leu Gin Ser Phe Phe Tyr He Thr Giu 
435 440 445 

Ser Ser Asp Leu Arg Asn Arg Thr Vai Tyr Phe Arg Lys Asp He Trp 
450 ~ 455 460 

Lys Leu Leu Cys Arg Pro Phe lie Thr Ser Met Lys Met Giu Ala Phe 
4 55 4 70 4 75 4 30 



Giu Lys He Asn Giu Ash Asn Vai Arg Met Asp Thr Gin Lys Thr Thr 
485 490 495 



Leu Pro Pro Ala Vai lie Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg 
500 505 510 
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Leu He Thr Asn Leu Arg Lys Arg Phe Leu He Lys Met Gly Ser Asn 
515 520 525 

Lys Lys Met Leu Val Ser Thr Asa Gin Thr Leu Arg Pro Val Ala Ser 
530 535 540 

He Leu Lys His Leu He Asn Glu Glu Ser Ser Gly He Pro Phe Asn 
545 550 555 560 

Leu Glu Val Tyr Met Lys Leu Leu Thr Phe Lys Lys Asp Leu Leu Lys 
565 570 575 

His Arg Met Phe Gly Arg Lys Lys Tyr Phe Val Arg lie Asp He Lys 
580 585 590 

Ser Cys Tyr Asp Arg He Lys Gin Asp Leu Met Phe Arg He Val Lys 
'595 . 600 605 

Lys Lys Leu Lys Asp Pro Glu Phe Val He Arg Lys Tyr Ala Thr lie 
610 615 620 

His Ala Thr Ser Asp Arg Ala Thr Lys Asn Phe Val Ser Glu Ala Phe 
625 630 635 640 

Ser Tyr Phe Asp Met Val Pro Phe Glu Lys Val Val Gin Leu Leu Ser 
645 656 655 

Met Lys Thr Ser Asp Thr Leu Phe Val Asp Phe Val Asp Tyr Trp Thr 
660 665 ' 670 " . 

Lys Ser Ser Ser Glu He Phe Lys Met Leu Lys Glu His Leu Ser Gly 
675 ' 630 635 

His He Val Lys He Gly Asn Ser Gin Tyr Leu Gin Lys Val Gly He 
690 695 700 

Pro Gin Gly Ser lie Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
705 * 710 715 ^ 720 

Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val 
725 730 " 735- 

Leu Leu Arg Val Val Asp Asp Phe Leu Phe He Thr Val Asn Lys Lys 
740 . 745 750 

Asp Ala Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys His 
755 760 765 

Asn Phe Ser Thr Ser Leu Glu Lys Thr Val lie Asn Phe Glu Asn Ser 
770 775 780 

Asn Gly He lie Asn Asn Thr Phe Phe Asn Glu Ser Lys Lys Arg Met 
735 ' • 790 795 800 

Pro Phe Phe Gly Phe Ser Val Asn Met Arg Ser Leu Asp Thr Leu Leu 
805 1 810 815 

Ala Cys Pro Lys He Asp Glu Ala Leu Phe Asn Ser Thr Ser Val Glu 
820 325 . 830 
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Leu Thr Lys His Met Gly Lys Ser Phe Phe Tyr Lys lie Leu Arg Ser 
835 840 845 

Ssr Leu Ala Ser Phe Ala Gin Val Phe lie Asp He Thr His Asn Ser 
. 850 855 - 860 

Lys Phe Asn Ser Cys Cys Asn lie Tyr Arg Leu Gly Tyr Ser Met Cys 
865 870 875 880 

Met Arg Ala Gin Ala Tyr Leu Lys Arg Mat Lys Asp lie Phe He Pro 
885 890 895 

Gin Arg Met Phe He Thr Asp Leu Leu Asn Val He Gly Arg Lys He 
900 905 910 

Trp Lys Lys Leu Ala Glu lie Leu Gly Tyr Thr Ser Arg Arg Phe Leu 
915 920 925 

Ser Ser Ala Glu Val Lys Trp Leu Phe Cys Leu Gly Met Arg Asp Gly 
930 935 940 



Leu Lys Pro Ser Phe Lys Tyr His 
945 950 

Gin Phe Gin Ser Leu Thr Asp Leu 
965 

Arg Gin Val Leu Phe Leu His Arg 
980 



Pro Cys Phe Glu Gin Leu Tie Tyr 

955 960 

He Lys Pro Leu Arg Pro Val Leu 

979 9^5 

Arg He Ala Asp 
985 



(2) INFORMATION FOR SEQ ID MO: 113: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDMESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 



Cxi) SEQUENCE DESCRIPTION: SEQ 10 NO: 113: 
CAAAACCCCA AAACC ' 15 



(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 16 base pairs 
{3) TYPE: nucleic acid 
(C) STRANDEDMESS : single 
(0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 11 4 
GGGGTTTTGG GGTTTT 



16 
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(2) INFORMATION FOR SEQ ID NO: 115: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CO) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: ■ 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 1762 

'(D) OTHER INFORMATION : /note= "gene encoding the 43 kDa 

telomerase protein subunit from 
Euplotes aediculatus" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: 

"CCCCAAAACC .CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG GTAGTTTAGA 60 

AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA TTTGGATGAT ATAGAAAATT 120 

TACTTCCTAA TACATTCAAC AAGTATAGCA GCTCTTGTAG T G AC AA G AAA GGATGCAAAA" 13 0 

CATTGAAATG TGGCTC.GAAA TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAACAATTAG 24 0 

AGTTCTACTT CTCGGATGCA AATCTTTATA ACGATTCTTT C T T G AG AAAA TTAGTTTTAA 30 0 

AAAGCGGAGA G C AAA.G AG T A GAAATTGAAA CATTACTAAT GTTTAAATAA AATCAGGTAA .360 

TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA T T AT GG AG AA AATTACTTAA 4 20 

TACTAAAAGG T..AAACAGTTT GGATTATTTC CCTAGCCAAC .AATGATGAGT ATATTAAATT. '430 

CATA.TGAGAA TGAGTC AAA.G GATCTCGATA CATCAGACTT AC C AAA. G A C A AACTCGCTAT 54 0 

AAAACGCAAG AAAAA.G T T T G ATAATCGAAC AG C AG AA.G AA CTTATTGCAT TT ACTATTCG 600 

TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC TTGAGACAAT 660 

T G AAAAAG C T GTTTACAACT GAAGGAATCG CAGTTCTGAA AGTTCTGATG TGTATGCCAT 7 20 

TATTTTGTGA ATTAATCTCA AATATCTTAT CTCAATTTAA TGGATAGCTA T AG AAAC AAA 78 0 

CCAAATAAAC CATGCAAGTT TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC 840 

TGAATTTATA TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 900 

TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT " AAAAGAAGCA 960 

GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT TTGTTGATTC TTCTGTAACC 1020 

G G AAT T AA.C A ACAAGAATAT TAGCAACGAA AAAGAAGAAG AGCTATCACA ATCCTGATTC 1030 

TTAAAGATTT C AAAAAT T C C AGGTAAGAGA GAT AC AT TC A TTAAAATTCA TATATTATAG 114 0 

TTTTTCATTT CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 1200 
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GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT TCACATTCAT 12 60 

AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA GCAGTCATCC GTTTTAAAAA 132 0 

TAGTGCTATG AGGACTAAAT TTTTAGAGTC AAGAAATGGA GCCGAAATCT TAATCAAAAA 138 0 

GAATTGCGTC GATATTGCAA AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA 14 4 0 

ATCTTGATTG ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 1500 

GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG CG AT CTTCAA 15 60 

TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA ATACAAACCT TGGTCAAAAT 1620 
ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA AAAGAAAAAA TAAGGCAATA AATAAAATGA • 1680 

GTACAGAAGT GAAGAAATAA AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAG'GGGTT 17 4 0 

TTGGGGTTTT GGGGTTTTGG GG 17 62 



(2) INFORMATION FOR SEQ ID NO: 116: 

( i ) SEQUENCE 'CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 
_(C) STRANDEDNESS : 
(0} TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID MO: 116: 

Pro Gin Asn Pro Lys Thr Pro Lys Pro Leu 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 117: 

Ci) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDMESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 117: 



Lys Lys Lys Lys Leu Arg 
1 * 5 
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(2) INFORMATION FOR SEQ ID NO: 118: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Phe Arg Asn Lys lie Leu Phe Pro His Lys Trp Arg Trp He Leu lie 
1 5 10 15 

Trp Met' lie 



(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE.: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 119: 

Lys lie Tyr Phe Leu lie His Ser Thr Ser He Ala Ala Leu Val Val 

r 5 10 15 

Thr Arg Lys Asp Ala Lys His Cys Asn Leu Ala Arg Asn Arg Leu His 

20 25 30 

Cys Leu Phe Gin Ser Cys Lys Asn Asn 
35 40 



(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

Ser Ser Thr Ser Arg Met Gin lie Phe lie Thr lie Leu Ser Cys Glu 
15 10 15 



Asn 
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(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 121 

Lys Ala Glu Ser Lys Glu 
1 5 



(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122 

Lys Leu Lys His Tyr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 12 3: - 

(i) "SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY •: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 

Cys Leu Asn Lys lie Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 

Cys Gly Leu Phe Tyr Phe Lea Asp His Phe Leu Arg Ser He Met Glu 
1 5 10 15 . 

Lys He Thr 



(2) INFORMATION FOR SEQ ID NO: 125: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:125: 

Lys Val Asn Ser Leu Asp Tyr Phe Pro Ser Gin Gin Cys Cys Val Tyr 
1 5 "10 15 



(2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:126: 

He His Met Arg Met Ser Gin Arg He Ser He His Gin Thr Tyr Gin 
1 5 10 15 

Arg Gin Thr Arg Tyr Lys Thr Gin Glu Lys Val Cys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 56 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 127: 



Ser Asn Ser Arg Arg Thr Tyr Cys He Tyr Tyr Ser Tyr Gly Phe Tyr 
1 5 10 15 
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Tyr Asn Cys Phe Arg Tyr Arg Arg Cys Thr Pro Glu Ser Cys Asp Asn 
20 . 25 30 

Cys Lys Ser Cys Leu Gin Leu- Lys Glu Ser Gin Phe Cys Lys Phe Cys 
35 40 - 45 



SEQUENCE DESCRIPTION: SEQ ID NO: 128: 

Gin lie Ser Tyr Leu Asn Leu Met Asp Ser Tyr Arg Asn Lys Pro 
5 10 15 



Cys Val Cys His Tyr Phe Val Asn 
50 55 



INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) 

Ser 

" 1 

Asn Lys Pro Cys- Lys Phe Asn Gly 
20 

Asn Ala His Cys lie Tyr He Gly 
35 40 

Cys Phe Arg Asp Cys 
50 

Ser Cys Ser Ser Leu 



Arg Leu Lys Lys Glu He Ser Lys 
85 

He Asn Asn Lys Asn He Ser Asn 
100 

Ser Cys Phe Leu Lys He Ser Lys 

115 120 



He Tyr Val Lys Ser Phe Gly Thr 
25 30 

Phe- Leu Lys His Arg Tyr Thr Glu 
45 



Phe Val Asp Ser Ser Val Thr Gly 
90 95 

Glu Lys Glu Glu Glu Leu Ser Gin 
105 _ 110 

lie Pro Gly Lys Arg Asp Thr Phe 
125 



Phe Ser Leu Gin Gin He Thr Cys Phe Asp Tyr 
55 -60 

He Ser Leu Lys Glu Ala Gly Glu Met Lys Arg 
7 0 7 5 3 0 



lie Lys He His He Leu 
130 



INFORMATION FOR SEQ ID MO: 129:. 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acicis 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:129; 

Phe Phe He Ser Gin Leu Leu Phe Ser Phe He Leu Thr He Phe Phe 
1 5 10 15 

Asp 



(2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:130: 

Leu Glu Val Lys Ser He Lys 

1 • 5 



(2) INFORMATION FOR SEQ ID NO: 131: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

Thr Glu Val Thr 
1 



(2) INFORMATION FOR SEQ ID MO: 13.2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

Leu He His He His Arg Ser Thr Phe He Tyr Pro lie Arg Cys 
1 5 10 15 
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(2} INFORMATION FOR SEQ ID MO: 133: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3 ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

Giy Asn Ser Ser His Pro Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH': 4 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: ' 

(D) TOPOLOGY: ' linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 134: 
Cys Tyr Glu Asp 



INFORMATION FOR SEQ ID NO: 135: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids' 
(3) TYPE*, amino acid 
(C) ' STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 135: 

lie Phe Arg Val Lys Lys Trp Ser Arg Asn Leu Asn Gin Lys Glu Leu 
1 5 10 15 

Arg Arg Tyr Cys Lys Arg He Glu Leu 
20 25 



(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS: 

{ D ) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 136: 

Val Leu Pro He Leu lie Asp Cys Arg Asp Cys Arg Gly Asn Cys Thr 
1 5 10 15 

Glu Asp His 



(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
(3) TYPE: amino acid 
"(C) ST RAND E D.NESS : 
(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 

Arg Asn Lys Val Thr Phe He Asn 
I ' 5 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 

He Thr Asn He Glu He Ser Asp Leu Gin Leu Thr Lys 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 



Thr He Lys Asn Thr Asn Leu Gly Gin Asn lie Glu Glu- Gly Lys Glu 
1-5 10 15 
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Asp Gin Leu Ala Lys Glu Lys lie Arg Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO:140: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

lie Lys Cys ValGln Lys Cys- Arg Asn Lys Arg Phe lie Phe Phe Asn 
1 5 io is 

Asn Lea Leu Lys Arg Gly Val Leu Giy Phe Trp Gly Phe Gly 
20 25 30 



(2) INFORMATION" FOR SEQ ID NO: 14 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID MO : 1 4 I : 

Pro Lys Thr Pro Lys Pro Gin Asr. Pro Tyr Lys Lys Arg Lys Asn Cys 

J- 5 10 - 15 

Gly Ser Leu Glu lie Lys Tyr Tyr Ser Arg Thr Asn Gly Aso Gly ~yr 
20 25 30' 

Cys Phe Gly Cys Tyr Arg Lys Phe Thr Ser 

35 40 



(2} INFORMATION EOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS : 
(AJ LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

Tyr lie Gin Gin Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 14 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

Cys Gin Glu Arg Met Gin Asn lie Glu He Trp Leu Glu He Ala Phe 

1 5 10 15 

lie As? Tyr Ser Lys Val Ala Lys Thr He Arg Val Leu Leu Leu Gly 
20 25 30 

Cys Lys Ser Leu 

35 - 



[2: INFORMATION FOR SEQ ID NO: 144: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 4 4 : ' 

Arg Phe Phe Leu Glu Lys He Ser Phe Lys Lys Arg Arg Ala Lvs Ser 
1 5 10 15 

Arg Asn Cys Asn He Thr Asn Val 
20 



2) INFORMATION FOR SEQ ID NO: 145: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 1 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) lMOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 145: 

lie Lys Ser Gly Asn Glu Asp Tyr Ser lie Phe 

1 5 .10 

(2) INFORMATION If OR SEQ ID MO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID HO: 146: 

Gly Ala Leu Trp Arg Lys Lea Leu Asn Thr Lys Arg 
I 5 10 



(2) INFORMATION FOR SEQ ID NO : 1 4 7 : 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 7: 

Thr Val Trp He lie Ser Leu Ala Asn Asn Asp GIu Tyr lie Lys Phe 
1 5 10 ^ 15 

He Cys Glu Cys Val Lys Gly Ser Arg Tyr He Arg Leu Thr Lys Asp 
20 25 30 

Lys Leu Ala He Lys Arg Lys Lys Lys Phe Asp Asn Arg Thr Ala Glu 
35 40 45 

Glu Leu He Ala Phe Thr He Arg Met Gly Phe He Thr He Val Leu 

50 55 60 

Gly He Asp Gly Glu Leu Pro Ser Leu Glu Thr He Glu Lys Ala Val 
65 "70 75 30 

Tyr Asn Cys Arg Asn Arg Ser Ser Glu Ser Ser Asp Val Tyr Ala lie 
85 90 - 95 



lie Leu Cys lie Asn Leu Lys Tyr Leu 
100 105 



lie 



Ser He 



106 



(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

CD) TOPOLOGY: linear 

(ii) MOLECULE" TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 

Trp He Ala He Glu Thr Asn Gin He Asn His Ala Ser Leu Met Glu 
1 • 5 10 15 ' 

Tyr The Leu Asn Pro Leu Gly Gin Met His Thr Glu Phe He Leu Asp 
20 25 30 

Ser 



(2) INFORMATION FOR SEQ ID NO: 14 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 29 amino acids 
■(B) TYPE: amino acid 
(C) STRANDEDNESS: 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

Ser lie Asp Thr Gin Asn Ala. Leu Glu Thr Asp Leu Ala Tyr Asn Arg 
1 5 10 * is 

Leu Pro Val Leu lie Thr Leu Ala His Leu Leu Tyr Lai: 
20 25 



(2) -INFORMATION FOR SEQ ID MO: 150: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

.'(ii) MOLECULE TYPE: peptide 



( x i ) SEQUENCE DESCRIPTION: SEQ ID NO:150: 



Lys Lys Gin Ala Lys Cys Lys Glu Aso 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 151: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:151: 

Arg Lys Arg Phe Gin Asn Leu Leu He Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO : 15 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

<:<i) SEQUENCE DESCRIPTION: ~S£Q ID MO : 15 2: 
Pro Glu Leu Thr Thr Arg tie Leu Ala The Lys Lys Lys Lys Ser Tyr 

- 1 5" 10 

His As n Pro As? Ser 
20 

i2) INFORMATION FOR -SEQ 10 MO : 153 : 

(i) SEQUENCE CHARACTERISTICS: 

"(A) LENGTH: 30 amino acids- 
(3) TYPE : amino .acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION: SEQ IDNO:lS3: 

Pft. Ola Lv, Phe Gin v.! G!u II. Ki, S« L.U Ly. Jh. II. 

1 5 

Tyr xyr Ser Phe Ser Phe His Ser Cys Tyr =he Leu Leu Ser 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 8 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 154: 

Gin Tyr Phe Leu He Ser Trp Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO : 1 55 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) "TYPE: amino_acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 

Lys Val Ser Asn Lys Arg Ser Ala Arg Leu Arg 
1 5 10 



INFORMATION FOR SEQ ID NO: 156: 

(i) SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 37 amino acids 
(3) TYPE: arriino acid 
■(C) STRANDEDNESS: 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



Ui) SEQUENCE DESCRIPTION: SEQ ID MO 

Leu Ser Leu Phe Thr Phe He Asp Arg 
1 5 



: 156: 

Pro Ser Tyr lie Gin Tyr Asp 
10 15 



Asp Lys Glu Thr Ala Val He Arg Phe Lys Asn Ser Ala Met Arg Thr 



20 



25 



30 



Lys Phe Leu Glu Ser Arg Asn Gly Ala Glu He Leu He Lys Lys Asn 
3 5 4 0 4 5 



Cys Val Asp He Ala Lys Glu Ser Asn Ser Lys Ser Phe Val Asn Lys 



50 



55 



60 



Tyr Tyr Gin Ser Cys Leu lie Glu Glu lie Asp Glu Ala Thr Ala Gin 



65 



70 



75 



80 
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Lys He He Lys Glu He Lys 
85 



(2^ INFORMATION FOR _SEQ ID NO:157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 



Leu Leu Leu He Arg Glu 
1 5 

(2) INFORMATION FOR SEQ ID MO: 153: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 158: 

Thr Lvs Leu Leu lie 

1 5 - 

(2) INFORMATION FOR SEQ ID NO: 159: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 159: 

Arg S*r Ala lie Phe Asn Cys Arg Asn Lys Ser Cys Thr Lys Va 1 Arg 
i 5 10 15 



Gin 
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(2) INFORMATION FOR SEQ ID NO: 160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 160: 

Lys lie Gin Thr Leu Val Lys lie Leu Arg Lys Glu Lys Lys Thr Ser 
1 5 . 10 "15 



(2) INFORMATION FOR SEQ ID NO : 161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

Gin Lys Lys Lys 
1 



INFORMATION FOR SEQ ID MO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 amino acids 
: (S) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(iij MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 

Asp. Glu Tyr Arg Ser Glu Glu He Lys Asp Leu Phe Phe Ser He He 
1 5 10 15 

Tyr Cys Lys Glu Gly Phe "Trp Gly Phe Gly Val Leu Gly 
20 25 



(2) I M FORMAT ION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



1 > I 



(ii) MOLECULE TYPE: peptide 



(Ki) SEQUENCE DESCRIPTION: SEQ ID MO: 163:. 

Pro Lys Pro Gin Asn Pro Lys Thr Pro lie Lys Lys Glu Lys lie Glu 
1 5 10 15 

Val Val 



(2) INFORMATION FOR SEQ ID NO: 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A). LENGTH: 94 amino acids 
.(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

Asn He He Pro Ala Gin Mec Glu Met Asp He Asp Leu Asp As? He 
1 5 -10 " 15 ' 

Glu Asn Leu Leu Pro Asn "Thr Phe Asn Lys Tyr Ser Ser Ser Cys Ser 
'20 25 30 

Asp Lys Lys Gly Cys Lys Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser 
35 40 45 



,eu Thr He Pro Lys Leu Gin Lys Gin Leu Glu ?he Tyr Phe Ser Asp 

50 35 60 - 



Ala Asn Leu Tyr Asn -Asp 
h5 ' 7 0-" 



Phe leu Arg Lys Leu Val Leu Lys Ser 
75 30 



Gly Glu Gin Arg Val Glu lie Glu Thr Leu Leu Met Phe Lys 
35 " 90 



(2) INFORMATION FOR SEQ ID MO: 165: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(3) TYPE: amino acid 

(C) STRAMDEDNESS: 

(0) TOPOLOGY: linear ■ 



(ii) MOLECULE TYPE: peptide 



(:•;!) SEQUENCE DESCRIPTION: SEQ ID MO: 165: 

Asn Gin Val Met Arg He lie Leu Phe Phe Arg Ser Leu Leu Lys Glu 
1 5 10 15 
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His Tyr Gly Glu Asn Tyr Leu lie Leu Lys Gly Lys Gin Phe Gly Leu 
20 25 30. 

Phe Pro 



2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : . pept ide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 

Pro Thr Met Met Ser lie Leu Asn Ser Tyr Glu Asn Glu Ser Lys Asp 
L 5 10 15 

Leu Asp Thr Ser Asp Leu Pro Lys Thr Asn Ser Leu 
20 2 5 



2) INFORMATION FOR SEQ ID NO: 167: 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

■ii} MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 

Asn Ala Arg Lys Ser Leu lie He Glu Gin Gin Lys Asn Leu Leu His 
1 5 '10 15- 

Leu_ Leu Phe Val Trp Val Leu Leu Gin Leu Phe 
20 " 25 



2) INFORMATION FOR SEQ ID NO: 163: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 6 amino acids 
{8) TYPE: amino acid 
(C) STRANDEDNESS: 
■ (D) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 168: 



Val 
1 



Ser Thr Val Asn Ser Arg Val Leu Arg Gin Leu Lys Lys Leu Phe 
■5 10 15 
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Thr Thr Glu Gly lie Ala Val Leu Lys Val Leu Met Cys Met Pro Leu 

20 25 30 

Phe Cys Glu Leu lie Ser Asn lie Leu Ser Gin Phe Asn Gly 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS : 

(A) - LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 169: 

Lys Gin Thr Lys 
1 



INFORMATION FOR SEQ ID MO: 170: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE:~amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

Thr Mer Gin Val 

1 - 



(2; INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 4 amino acids 
{ 3) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

T rp Asn lie Arg 

1 
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(2) IN FORMAT ION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO:172; 



lie Leu Trp Asp Lys Cys Thr Leu Asa Leu Tyr Trp lie Leu Lys Ala 
1 5 . 10 ' 15 



(2) INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(8) TYPE: amino acid 
(C) STRANDEDNESS: 
CD) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID MO: 173: 

lie His Arg Met Leu 

1 5 



INFORMATION FOR SEQ ID NO: 174: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 
T3) TYPE: amino acid 
. (C) STRANDEDNESS: 
(0) TOPOLOGY: linear' 

(ii) .MOLECULE TYPE: peptide 



( xi ) SEQUENCE DESCRIPTION: SEQ ID MO:174: 

Leu Thr Thr Asp Tyr Leu Phe Cys Leu Leu Leu Leu lie Ser Tyr lie 

15 10 15 

Phe Lys Arg Ser Arg Arg Asn Glu Lys Lys Thr Lys Giu Arg Asp Phe 

20 25 30 

Lys lie Cys Cys Phe Phe Cys Asa Arg Asn 
35 40 
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(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE ; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 

Gin Gin Glu Tyr 
1 



(21 INFORMATION FOR SEQ ID NO:176: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

( D ) TOPOLOGY: linear 

.(ii) MOLECULE TYPE : 'peptide 



(xi) SZQUEblCZ DESCRIPTION: SEQ ID NO: 176: 

-Gin Arg Lys Arg Arg Arg Ala He Thr He Leu He Leu Lys Asp Phe 
5 10 15 

Lys Asn Ser Arg 
20 



INFORMATION FOR SEQ ID' HO: 177: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
£3) TYPE: amino acid 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(;<i) SEQUENCE DESCRIPTION: SEQ ID NO: 177: 

Glu Arg Tyr He His 
1 5 



[2] INFORMATION FOR SEQ ID NO: 173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 5 amino acids 
(3) TYPE; amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 
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Cii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID MO: 173: 

Asn Ser Tyr He He Val Phe His Phe Thr Ala Val He Phe Phe Tyr 

15 10 15 

Leu Asn Asn lie Phe Cys Leu Ala Gly Ser Lys Lys Tyr Gin He Arg 

20 25 30 

Glu Ala Leu Asp Cys Gly Asn Leu Ala Tyr Ser His Ser 
35 40 45 



■(2) INFORMATION FOR SEQ ID NO: 179: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 amino acids 
(3) TYPE: amino acid 

(C) STRANOEONESS: 

(D) TOPOLOGY : linear 

(.ii) MOLECULE TYPE: peptide 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID MO:179: 

lie Aso Leu His He Ser Asn Thr Met He Arg Lys Gln GIn Ser Ser 
1*5 10 15 

Val Leu Lys He Val Leu Cys Gly Leu Asn Phe 
2 0 2 5 



::: FORMAT IOM FOR SEQ ID MO: 130: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 3 amino acids 
(3) TYPE: amino acid. 

(C) STRANOEONESS: 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 180: 

Ser Gin Glu Met Glu Pro Lys Ser 
1 5 



(2) INFORMATION FOR SEQ ID MO: 181: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 1 amino acids 
(3) TYPE: amino acid 

(C) STRANOEONESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

Ser Lys Arg lie Ala Ser lie Leu Gin Lys Asa Arg Thr Leu Asn Leu 
15 10 15 

Ser Leu He Ser He Thr Asn Leu Asp Cys Leu 'Lys Arg Leu Thr Arg 
20 25 30 

Gin Leu His Arg Arg Ser Leu Lys Lys 
35 40 



(2) INFORMATION FOR SEQ ID NO: 13 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : . 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 132:. 

Ser Asn Phe Tyr 
1 



(2) INFORMATION FOR SEQ ID MO: 183: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 7 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
( 0 ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 183: 

Leu GIu Asn Lys Lau Asn Tyr 
I 5 



(2) INFORMATION FOR SEQ ID NO: 18 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 53 amino. acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 184: 

Tyr Arg Asp Gin Arg Ser Ser lie Asp Glu He Lys Ala Glu Leu Lys 
1 5 10 . 15 
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Leu Asp Asn Lys Lys Tyr Lys Pro Trp Ser Lys Tyr Cys Gly Arg Lys 
20 25 30 

Arg Arg Pro Val Ser Lys Arg Lys Asn Lys Ala lie Asn Lys Met Ser 
35 40 45 

Thr Glu Val Lys Lys 
50 



INFORMATION FOR SEQ ID NO: 18 5: 

ti) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 185: 

Lvs lie Tyr Phe Phe Gin 

f 5 



INFORMATION FOR SEQ ID NO: 186: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 13 amino acids 
(3) TYPE: amino acid 
<C) STRANDEDNESS : 
(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE: oeotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 

Phe lie Glu Lys Arg Gly Phe Gly Val Leu Gly Phe Trp 
1 5 10 



IN FORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1007 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) . TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 137; 

Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly lie His Ser Ala 
1 " 5 10 15 
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Leu Lys Thr Cys Glu Glu He Lys Glu Ala Lys Thr Leu Tyr Ser Trp 
20 25 30 

He Gin Lys Val He Arg Cys Arg Asn Gin Ser Gin Ser His Tyr Lys 
35 40 45 

Asp Leu Glu' Asp He Lys lie Phe Ala Gin Thr Asn lie Val Ala Thr 
50 55 60 ■ 

Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val He Ala Arg Lys Glu 
65 70 75 80 

Val Phe Ser Thr Gly Leu Met' lie Glu Leu lie Asp Lys Cys Leu Val 
35 90 95 

Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin Cys 
100 105 110 

Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu Leu 
115 120 125 

Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn 
'130 135 140 

Gin Val Arg Ala Met: He Gly Asn- Glu Leu Phe Arg His Leu Tyr Thr 
145 150' , 155 160 

Lys Tyr Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe 
165 170 175 

Cys.'Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp 
180 * 135 150 

Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys 
195 ' 200 205 

Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn 
210 ^ 215 220 

Asn lie Asn Val Pro Asn, Trp Asn Asn Met .Lys Ser Arg Thr Arg He 
225 230 W 235 240 

Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His 
245 " 250 255 

Glu Phe Val Ser Asn Lys Asn Asn He Ser Ala Met Asp Arg Ala' Gin 
260 265 270 

Thr Tie Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys Leu 
275 . 230 235 

Lys Aso Lys Val lie Glu Lys He Ala Tyr Met Leu Giu Lys Val Lys 
290 295 300 

Aso Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu 
305 310 315 320 

Asn Tro Arg Glu Arg Lys Gin Lys He Glu Asn Leu He Asn Lys Thr 
325 330 335 



120 



Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr 
340 345 350 

Asp Asn Lys Cys Val Thr Gin Phe He Asn Glu Phe Phe Tyr Asn He 
355 360 365 

Leu Pro Lys Asp Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys 
370 375 380 

Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His Lys 
335 390 395 400 

Asa Leu Leu Leu Glu Lys He Asn Thr Arg Glu He Ser Trp Met Gin 
405 410 415 

• Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn lie 
420 425 430 

Tyr Val Leu Trp Lys Leu Leu Arg Trp lie Phe Glu Asp Leu Val Val 
435 440 445 

Ser Leu He Arg Cys Phe Phe Tyr Vai Thr Glu Gin Gin Lys Ser Tyr 
4 50 455 ' 460 

Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn lie Trp Asp Vai lie Met Lys 
465 470 , 475 430 

Met Ser lie Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu 
435 490 495 

Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Aia Pro Gly Lys 
500 " . 505 510 

Leu Arg Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe 
515 \ 520 525 

Asn Lys Lys He Vai Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr 
530 535 * 54 0 

Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn 
545 550 ' 555 560 . 

Arg Met. Phe Lys Asp Pro Phe Gly Phe Ala Vai Phe Asn Tyr Asp Asp 
565 570 575 

Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly 
580 535 ' 590 

Gin Pro Lys Leu Phe Phe Ala Thr Met Asp He Glu Lys Cys Tyr Asp 
595 600 605 

Ser Val- Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu 
610 " 615 620 

Leu Ser Ser Asp Phe Tro He Met Thr Ala Gin He Leu Lys Arc Lys 
625 630 635 640 



Asn Asn He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu. Met Lys 
645 650 655 
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Asp Tyr Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin 
660 665 670 

Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu Asn 
675 680 685 

Ala Lys Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys 
690 695 700 

Lys Asp Asn Leu Leu Gin Pro Val lie Asn He Cys Gin Tyr Asn Tyr 
765 710 715 720 

lie Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin 

730 ■ 735 



725 



Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu 
740 745 750 

Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu 
755 760 755 

Asn P-o Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu 
770 775 780 

lie Thr Thr Gin Glu Asn Asn Ala -Val Leu Phe lie Glu Lys Leu lie 
■ 735 790" - 795 800 

Asn Val Ser Arg Glu Ash Gly Phe Lys Phe Asn Met Lys Lys Leu Gin 
805 810 813 

Thr -Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Me". Asp 



820 



325 830 



Ser Val Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp He Gly 



83: 



3 40 



rio S " Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn Leu 

350 355 .860 

Arg He Glu Gly He Leu Cys Thr Leu Asa. Leu Asn Met Gin Thr Lys 

865 870 875 

Lys A<a Ser Met Tro Leu Lys Lys Lys Leu Lys Ser Phe Leu Me-. Asn 

835 ■■ 890 893 

Asn lie Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe Ala 
900 905 9 10 . 

Asn Lys Thr Leu Asn Lys Leu Phe lie Ser Gly Gly Tyr Lys Tyr Met 
915 ■ 920 925 

Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met 
930 935 9"° 

Ser Ser Met lie Asp Leu Glu Val Ser Lys lie lie Tyr Ser Val Thr 

945 950 955 

arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr lie Phe 

' 965 970 97= 
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Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe 
980 985 990 

lie Glu lie Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 1000 I 005 



INFORMATION FOR SEQ ID MO: 188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) -MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp lie Gin Val Arg 

1 5-10- -15 

Val Lys lie Aso Asp Pro Lys Gin Tyr Leu Val Asn Val Thr Ala Ala 
20 25 30 

Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu Arg Arg 
35 40 45 

Tyr He lie Thr Lys Ala Leu 'Leu Glu Val Ala Glu Ser Asp Pro Glu 
50 55 60 

Ppo n. cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr lie Arg 
65 . . ™ .7 5 30 

T-, r vnr- Thr Asn T vr He Val Ala Phe Cys Val Val His Lys Asn Thr 
35 90 95 

Gin Pro Phe lie Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn 
100 105 HO 

Aso Leu Leu Glu Val Cys Glu Phe A.ia Gin Val Leu Tyr lie Phe Asp 
1.15 120 12 = 

Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser Gin Asp 
130 135 140 

II- Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg 
US 150 155 160 

Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr 

.165 no 175 

Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val Thr Asn 
180 185 190 

Lys Gin Lys Tro Asp Gin Thr Lys Lys Lys Arg Lys GLu Asn Leu Leu 
195 200 205 

Thr Lvs Leu Gin Ala lie Lys Glu Ser Glu Aso Lys Ser Lys Arg Glu 
210 215 220 
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Thr Gly Asp lie 
225 

Ala Val Met Lys 



Met Lys Ala Pro 
260 

Thr Phe Lys Asp 
275 

Arg Val Tyr Lys 
290 

Tyr Lys Ala Ala 
30 5 

Leu Ala Gly Lys 



Giu Leu Ser Ala 
340 

Ser Ser Asn Gin 
355 

lie Leu Lys Ala 
370 

Lys lie Cys Giu 
335 

Gin Phe Phe Ser 



?he Lys Ala Lys 
420 

Ala Val Lys Giu 
435 

Giu Leu Giu Gin 
450 

He Gly Lys Gin 
4 65 

Val Asn Lys Asn 



Asp Val Ser Gly 
500 

Tyr Gly Ser Val 
515 

Met Val Lys Gin 
53Q 



Met Asn Val Giu 
230 

Lys He Ala Lys 
245 

Lys He Pro Asn 



Leu He Lys Phe 
280 

He Leu Gly Lys 
295 

Phe Gly Asp Ser 
310 

Arg Met Lys lie 
325 

Lys Gly Asn Thr 



Leu Pro Tyr Met 
360 

Gly Val Ser Asp 
- 375 

Pro Lys Ala Val 
390 

Ala He Giu Ala 
405 

Lys Arg Giu Asn 



Val Val Giu Lys 
4 4 0 

Thr Giu Giu Gly 
'4 55 

Tyr He Asn Ser 
470 

Leu Asp Giu He 
485 

Ser Met Ser Thr 



Arg Thr Cys Leu 
520 

Arg Cys Giu Lys 
535 



Asp Ala He Lys 
235 

Arg Gin Asn Ala 
250 

Ser Thr Leu Giu 
265 

Cys His lie Ser 



Lys Tyr Pro Lys 

300 

Ala Ser Ala Pro 
315 

Giu He Ser Lys 
330 

Ala Giu Val Trp 
345 

Ala Met. Leu Arg 



Thr "Thr His Ser 
330 

Giu Asn Ser Lys 
395 

Vai Asn Giu Ala 
410 

-Met Asn Leu Lys 
425 

Thr Asp Giu Giu 



Giu Phe Val Lys 
460 

lie Giu Leu Ala 
475 

Lys Gly His Thr 
4 90 

Ser Met Ser Gly 
505 

Giu Cys Ala Leu 



Ser Ser Phe Tyr 
540 



Ala Leu Lys Pro 
240 

Met Lys Lys His 
255 

Ser Lys Tyr Leu 
270 

Giu Pro Lys Giu 
285 

Thr Giu Giu Giu 



Phe Asn Pro Giu 
320 

Thr Trp Giu Asn 
335 

Asp Asn Leu He 
350 

Asn Leu Ser Asn 
365 

lie Val He Asn 



Met Phe Pro Leu 
400 

7a 1 Thr Lys Gly 



Gly Gin He Giu 
4 30 ' 

Lys Lys Asp Met 
4 45 

Val Asn Giu Gly 



He Lys ' He Ala 
480 

Ala He Phe Ser 
495 

Gly Ala Lys Lys 
510 

Val Leu Gly Leu 
525 

He Phe Ser Ser 
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Pro Ser Ser Gin Cys Asn Lys 
545 550 

Asp Glu Leu Arg Pro Ser Met 
565 

Leu Gly Gly Gly Thr Asp Phe 
580 

Lys Asn Lys Thr His Val Asp 
595 

lie Ala Glu Gly Tyr Ser Asp 

610 ' 615 

Asn Ser lie Lys Lys Tyr Lys 

625 630 ■ 

Phe Ala Val Asp Leu Glu Gly 
645 

Glu Phe Asn Glu Asn Asn "Tyr 
660 

He Leu Lys Phe lie Ser Ala 
675 

Val lie Lys Asn Phe Ala Leu 
690 695 



IMTORMAT ION FOR SEQ ID t:0 : 13 9: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 397 amino acids 
(3) TYPE: amino acid 

(C) STRANDSDtMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 9: 

Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp Asn Gin Val 

I 5 10 15 

Arq Ala Met He Gly Asn Glu Leu Phe Arg His Leu Tyr Thr Lys Tyr 
20 25 30 

Leu He Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin Phe Cys Gly 
35 40 45 

Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe Asp Lys Lys 
50 55 ^ 60 

Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys Cys Ser Thr 
65 70 . 75 -80 

Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu Asn Asn He 

" 85 90 95 



Cys Tyr Leu Glu Val 
555 

Gin Lys Leu Leu Gin 
570 

Pro Tyr Glu Cys He 
585 

Asn He Val He Leu 

600 

He Asn Val Arg Gly 
620 

Asp Glu Val Asn Pro 
635 

Tyr Gly Lys Cys Leu 
650 

He Lys lie Phe Gly 
665" 

Lys Gin Gly Gly Ala 
680 

Gin Lys lie Gly 



Asp Leu Pro Gly 
560 

Glu Lys Gly Lys 
575 

Asp Glu Trp Thr 
590 

Ser Asp Met Met 
605 

Ser Ser He Val 



Asn He Lys He 
640 

Asn Leu Gly Asp 
655 

Met Ser Asp Ser 
670 

Asn Met Val Glu 
63 5 
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Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg lie. Phe Tyr 
100 liu 

Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys His Glu Phe 

115 I 20 . . 

Vai ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala Gin Thr He 
130 135 I 40 

Phe Thr Asn He Phe Arg Phe Asn Arg He Arg Lys Lys Leu Lys Asp 
145 150 155 

Lys Val lie Glu Lys lie Ala Tyr Met Leu Glu Lys val Lys Asp Phe 
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Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro Glu Asn Trp 



180 



185 



Arg Glu Arg Lys Gin Lys lie Glu Asn Leu lie Asn Lys Thr Arg Glu 

195 200 
Glu Lvs Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr Thr Asp Asn 

. 2L0- 215 220 

Lys Cvs Val Thr Gin Phe He Asn Glu- Phe Phe Tyr Asn He Leu Pro 
225 " 230 - 235 

Lys Aso Phe Leu Thr Gly Arg Asn Arg Lys Asn Phe Gin Lys Lys Val 

245 ' 250 

Lvs Lvs. Tyr Val Glu Leu Asn Lys His Glu Leu lie His Lys Asn Leu 

260 . 265 

L-u L-u Glu Lys lie Asn Thr Arg Glu He Ser Trp Met Gin Val Glu 
275 230 285 

^- -la Lvs His Phe Tyr Tyr Phe Asp His Glu Asn He Tyr Val 
11] " • 295 • 300 

L-u Tco Lys Leu Leu Arg Trp He Phe Glu Asp Leu Val Val Ser Leu 
305 " 310 315 

II- -a Cys Phe Phe Tyr Val Thr Glu Gin Gin Lvs Ser Tyr Ser Lys 
325 330 

Thr Tyr Tyr Tyr Arg Lys Asn He Trp Asp Val He Met Lys Met Ser 



340 



345 



He Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin Glu Lys Glu 
355 360 3o5 
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1 Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg 



370 



375 



Leu He Pro Lys Lys Thr Thr Phe Arg Pro He Met Thr Phe Asn Lys 



390 



Lys lie val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr 
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Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys Asn Arg Met 
420 425 430 

Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp Asp Val Met 
4-35 _ 440 4 45 

Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro 
450 455 460 

Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser Val 
465 470 475 480 

Asn Arg Glu' Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys Leu Leu Ser 
485 490 495 

Ser A.sp -Phe Trp lie Met Thr Ala Gin He Leu Lys Arg Lys Asn Asn 
500 - 505 510 

He Val He Asp Ser Lys Asn Phe Arg Lys Lys Glu Met Lys Asp Tyr 
515 520 525 

Phe Arg Gin Lys Phe Gin Lys He Ala Leu Glu Gly Gly Gin Tyr Pro 
530 535 540 

Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp. Leu Asn Ala Lys 
545 550 - 555 560 

Lys Thr Leu He Val Glu Ala Lys Gin Arg Asn Tyr Phe Lys Lys Asp 
565 570 575 

Asn Leu Leu Gin Pro Val lie Asn lie Cys Gin Tyr Asn Tyr He Asn 
530 585 590 

Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro Gin Gly Leu 
595 * 600 605 

Cvs val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu Glu 
610 615 " 620 

Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro Glu Asn Pro 
625 630 " ' 635 ' 640 

Asa Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr 
645 650 655 

Thr Gin Glu Asn Asn Ala Val Leu Phe lie Glu Lys Leu lie Asn Val 
660 665 670 

Ser Arg Glu' Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser 
675 680 695 

Phe Pro Leu Ser -Pro Ser Lys Phe Ala Lys Tyr Gly Met: Asp Ser Val 
690 695 700 

Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp lie Gly lie Ser 
705 710 715 720 

He Aso Met Lys Thr Leu Ala Leu Met Pro Asn lie Asn Leu Arg He - 
725 730 735 
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Glu Gly He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr Lys Ly^ Ala 
740 745 750 

Ser Met Tro Leu Lys Lys Lys Leu Lys Ser Phe Leu Met Asn Asn lie 
755 7 60 765 

Thr His Tyr Phe Arg Lys Thr lie Thr Thr Glu Asp Phe Ala Asn Lys 
770 775 780 

Thr L-u Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr Met Gin Cys 
785 790 795 800 

Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala Met Ser Ser 
805 810 815 ■ 

Met Tie Asp Leu Glu Val Ser Lys He He Tyr Ser Val Thr Arg Ala 
820 825 830 

Phe =he Lys Tyr Leu Val Cys Asn lie Lys Asp Thr lie Phe Gly Glu 
335 840 845 

Gi u w*s Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His Phe lie Glu 
850 ' 855 860 

rLe s-" Thr Lys Lys Tyr Lie ' Phe -Asn Arg Val Cys Met lie Leu 

865- " 870 875 380 

Lvs 2. a Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin Ser Leu He 
' "" 885 " 890 89= 

Gin 



INFORMATION FOR SEQ ID MO: 190: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH': 840 amino acids . 
(3) TYPE: amino acid 
(C) STRANDEONESS : 
'(D) TOPOLOGY: linear 

(ii) K01ECULE TYPE: peptide 



(;-.i) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

Met S-r Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro lie Gly Asp. Glu 
1 " 5 10 Lj 

Thr Asn Leu Asp Phe Val Leu Gin. Asn Leu Glu Val Tyr Lys Ser Gin 
20 . 25 30 

lie Giu His Tyr Lys Thr Gin Gin Gin Gin lie Lys Glu Glu As? Leu 
35 40 45 

Lvs L=u Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 6.0 

Aso Aso Aso Asp Glu Glu Asn Asn Ser Asn Lys Gin Glr. Glu Leu Leu 
65 ' ' 70 ^5 80 



128 



Arg Arg Val Asn Gin lie Lys Gin Gin Val Gin Leu lie Lys Lys Val 
85 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
- 100 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 

lie Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
145 150 155 160 

Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu lie Ser His Asp Gin 
165 170 175 

Lys Asn Tyr Val Ser lie Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
130 185 190 

Trp Leu Lys Asp Tyr Phe Asn- Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 " 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 - 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

He Asp Val Asn Phe Asp Asn Asn Leu Cys lie Leu Ala Leu Leu Arg 
245 ' 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn lie Leu Asn lie Arg Ser Ser 
. 2G0 265 270 

Tyr Thr Arg Asn Gin Tyr Asn Phe Glu Lys He Gly Glu Leu Leu Glu 
275 230 23 5 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
2 90 295 300" 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 "310 315 320 

Gin lie Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Asp Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 L 345 350 

Leu Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala He Pro 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asr. Val Leu Leu Lys Lys 
370 375 330 

Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 
335 390 395 400 
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Phe Aso Phe Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
405 410 "15 

Leu Glu Pro Asn lie Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu 
420 425 430 

Ser lie Lys Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

Thr Tyr Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
450 455 460 

Thr Thr lie Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro 
465 470 475 480 

Glu Thr Lys Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
485 490 495 

Ph* ^sp His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu He 
515 520 525 

^rq Ser Thr Asn Leu Lys Lys Phe Lys -Leu Ser Tyr Lys Tyr Glu Met 

' - 530 535' - 540 

Glu rvs Ser Lys Met Asp Thr Phe lie Asp Leu Lys Asn lie Tyr Giu 

5.45 550 - 555 560 

\sn Asn Leu Lys Arg Cys Ser Val Asn lie Ser Asn Pro His 



Th>- 



565 



570 57: 



Glv ~sn lie Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
530 535 590 

=>h- ■ vs Leu Thr Leu Asn Gin Glu Le-U Gin His Ala Lys Tyr Thr Phe 
595 600 605 - 

"ys r.ia Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
510 515 " 620 

S o r 3 =- Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 " ' 630 635 640 

II- -la S-r Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Pha Asn Lys Pro Asn Leu 
660 .665 670 

L^u 'he Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn' Val Ser 
6 75 630 

II* A 3 n Cvs He Leu Asp Gin His He Leu Asn Ser He Ser Giu Phe 
" 690 * 695 700 

Leu Glu Lvs Asn Lys Lys He Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 " 710 715 720 
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Leu Leu Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 730 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr He Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala Phe Tyr Glu Pro Leu Cys Glu Phe lie Lys Glu Ser Ser Gin 
770 775 780 

Thr Leu Gin Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser 
785 790 795 800 

He Lys Lys He Leu Glu Ser He Ser Glu Ser Lys Tyr His His Tyr 
805 810 815 

L-u Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu He Lys Ser Glu Asn 
820 825 830 

Glu Glu lie Gin Glu Leu Leu- Lys 
835 840 



INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 amino acids 
(3) TYPE: amino acid 

(C) STRAMDEDNESS : 

(D) TOPOLOGY: linear 

(Li) MOLECULE TYPE: peptide 

(:-.i) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

Asp lie Aso Leu Aso Asp He Glu Asn Leu Leu Pro Asn Thr Phe Asn 
1 5 ' 10 15 ■ 

Lys T V r Ser Ser Ser Cvs Ser Asp Lys Lys Gly Cys Lys Thr Leu Lys 
' 20 ■ " 25 3 ° 

Sor G iy Ser Lys Ser Pro Ser Leu Thr He Pro Lys "Leu Gin Lys Gin 
35 40 <5 

Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp Ser Phe Leu 



50 



55 60 



Arq Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu He Glu Thr 
65 ■ 70 75 30 

Leu Leu Met 
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INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE' DESCRIPTION: SEQ IDNO:192: 

Asn Val Lys Ser Ala Lys He Glu Ser Ser Ser Leu Glu Ser Leu Glu 
1 5 . 10 

Asp lie Asp Ser Lea Cys Lys Ser He Ala Ser Cys Lys Asn Leu Gin 



Asn Val Asn 
35 



20. 25 30 

He lie Ala Ser Leu Leu Tyr Pro Asn Asn lie Gin Lys 
40 45 

Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe Lys Gin Phe Glu Gin 
50 55 60'. 

L^u Lys Asn Leu Glu Asn Val Ser He Asn Cys He Leu Asp Glr. His 
65 70 



75 

Il« Leu Asn Ser lie Ser Glu Phe Leu Giu Lys Asn Lys Lys lie Lys 
85 90 95 

Ala Phe He Leu 
100 



INFORMATION TOR SEQ 10 NO: 13 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 5 amino acids. 

(B) TYPE: amino acid . ... 

(C) ST HANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l93: 

Met Glu Met Asp He Asp Leu Asp Asp He Giu Asn Leu Leu Pro Asn 
1 5 .10 

Ser Asp Lys Lys Gly Cys Lys 
30 

Leu Thr lie Pro Lys Leu 
40 45 

Gin Lvs Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn Asp 



Thr Phe Asn Lys Tyr Ser Ser Ser Cys 
20 25 



Thr Leu Lys Ser Gly Ser Lys Ser Pro Ser 
35 



50 55; 
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Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val Glu 
65 70 75 80 

lie Glu Thr Leu Leu 
85 



INFORMATION FOR SEQ ID HO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 99 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

He Glu Leu Ala He Lys He Ala Val Asn Lys Asn Leu Asp Glu He 

1 5 -. 10 15 

Lys Gly Kis Thr Ala He Phe Ser Asp Val Ser Gly Ser Met Ser Thr 

20 25* 30 

Ser Met Ser Gly Gly Ala Lys Lys Tyr Gly Ser Val Arg Thr Cys_ Leu 

35 40 45 

Glu Cys Ala Leu Val Leu Gly Leu Mat Val Lys Gin Arg Cys Glu Lys 

50 55 60 



Ser Ser Phe Tyr He Phe Ser Ser Pro Ser Ser Gin Cys Asn Lys Cys 
65 * 70 75 30 

Tyr Leu Glu Val Asp Leu Pro Gly Asp Glu Leu Arg Pro Ser Met Gin - 
8 5" 90 95 

Lys Leu Leu 



INFORMATION FOR SEQ ID NO: 195: 

(!) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE ; 

(A) NAME/KEY: Peptide 

( B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note- "consensus telornerase RT sequenc 

from motif A" 
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(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-s it e 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, 'lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE.: 

(A) NAME/KEY: Modi f ied-s it e 

(B) LOCATION : 4 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE : 

(A) NAME/KEY: Modif ied-s ite 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /product-* "OTHER" 

/note 35 "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Tr? 
or Met" 

(ix) FEATURE: , - 

(A)r NAME /KEY : Modif ied-s i te - 
(3) LOCATION: 10 

(D) OTHER INFORMATION: /product- "OTHER" 

/note 35 "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Tr? 
or Met" 

(i:<) FEATURE: 

(A) NAME /KEY : Mod i f i ed - s i te 
(3) LOCATION: 13 

(D) OTHER INFORMATION: /product- "OTHER " ~ 

■/note- "Xaa - hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Tr? - 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 195: 

Xaa Xaa Xaa Xaa Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1 5 10 



(2} INFORMATION FOR SEQ 10 NO: 196: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE; 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 13 

(D) OTHER INFORMATION: /note= ''consensus telomerase RT sequenc 

from motif B" 



(ix) .FEATURE : 

(A) NAME/KEY: Modif ied-s i te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product* "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

Xaa Xaa Xaa Xaa Xaa Xaa Gin Gly Xaa Xaa Xaa Ser. Pro 
1 5 10 



INFORMATION FOR SEQ ID NO: 197:- 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 
(3) TYPE: amino acid 
(C} STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE : 

(A) NAME/ KEY: Peptide 
- (3) LOCATION: 1 /. 10 

{D) OTHER IN FORMATION 1 : /p,ote= "consensus telomerase RT sequenc 

from motif C" 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 
(3) LOCATION: 1 

( D) OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa - hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 
( 8 ) LOCATION: 5 

(D) OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-site 
(3) LOCATION: 8 . . 10 

(D) OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa = hydrophobic amino acid, 
Ala, Leu, Tie, Val, Pro, Phe, Trp 
or Me t " 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 197: 

Xaa Xaa Xaa Tyr Xaa Asp Asp Xaa Xaa Xaa 
1 5 10 

INFORMATION FOR SEQ ID NO: 198: 

(i) SEQUENCE CHARACTERISTICS: 
(A.) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 8 n ^ 

(D) OTHER INFORMATION: /note- "consensus telomerase RT sequence 

from motif D" 



(ix) FEATURE : 

(A) NAME /KEY : Modi f ied-s i t e 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER"- _ 

/note= "Xaa = hydrophobic amino acid, 
Ala,- Leu, He, Val, Pro, Phe, Trp 
or Met" 

(i:<) FEATURE: 

(A) NAME /KEY : Modi f led- s i te 
(3) LOCATION : 4 

(D) OT HER INFORMATION: /product- "OTHER" 

/note- "Xaa - hydrophobic ammo acia., 
Ala'^Leu, lie, Val/ Pro, Phe, Tro 
or Met" 



(xi) "SEQUENCE DESCRIPTION: SEQ 10 MO:193 

Gly Xaa Xaa Xaa Xaa Xaa Xaa Lys 
1 5 

INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: 

( D) TOPOLOGY : linear 



(ii) MOLECULE TYPE : peptide 
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(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1..8 Qorm^nre 
(D) OTHER INFORMATION: /note- "consensus teiomerase Ri sequence 

from motif E" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-s ite 

( B ) LOCATION: 1 • ( 
(D) OTHER INFORMATION: /product- "OTHER 

cuj uititK. "/note- "Xaa - hydrophobic amino acid, 

Ala, Leu, He, Vai, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 

(B) LOCATION: 3 

(0) OTHER INFORMATION: /product= "OTHER 

Uinc ^ /note- "Xaa - hydrophobic amino acid, 

Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) MAME/KEY: Modi f ied-s i te 

(3) LOCATION : 6 "OTH^R" 

(0) OTHER INFORMATION : /prccuct= OTH-R _ 

tuJ /note= "Xaa - hydrophobic amino acid, 

Ala, Leu, He, VaL, Pro, Phe, Tro 

or Met" 

(ix) FEATURE: 

(A) MAME/KEY : Modi f ied-s i te 
(3) LOCATION: 8 

f D) OTHER INFORMATION: /product- "OTHER 

OLr '^ - /note= "Xaa - hydrophobic ammo acid, 

Ala, Leu, He, Val, Pro, Phe, Trp - 
or Met" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO:199: 

vaa Xaa Xaa Leu Gly Xaa Xaa Xaa 
I 5 

(2} INFORMATION FOR SEQ ID NO:2G0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION : 1 . . 34 . , 

(D) OTHER INFORMATION: /note- "motif A peptide from 
. Dong (LINE) " 



137 



Cxi) SEQUENCE DESCRIPTION : SEQ ID NO:200: 

Lys Asn Arg Asn Leu His Cys Thr Tyr He Asp Tyr Lys Lys Ala Phe 
1.5 10 15 

Asp Ser He Pro His Ser Trp Leu lie Gin Val Leu Glu lie Tyr Lys- 
20 25 " 30 

lie Asn 



INFORMATION FOR SEQ ID NO: 201: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

" (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3} LOCATION: 1 . , 35 

( D) OTHER INFORMATION: /note- '"motif B peptide from 

Dong (LINE)'* 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:201: 

Arg Gin He Ala He Lys Lys Gly lie Tyr Gin Gly Asp Ser Leu Ser 
15 10 15 

Pro Leu Trp Phe Cys Leu Ala Leu Asn Pro Leu Ser His Gin Leu His 
20 25 30 

Asn Aso Arg 

- . " 35" 



INFORMATION FOR SEQ ID MO: 202: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 3 amino acids 
(3) TYPE: amino acid 

(C) ST RAND EDM ESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . . 4 3 

(D) OTHER INFORMATION: /note- "motif C and D peptide from 

Dong (LINE) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: 

His Leu lie Tyr Met Asp Asp lie Lys Leu Tyr Ala Lys Asn Asp Lys 
1.5 10 15 

Glu-Met Lys Lys Leu lie Asp Thr Thr Thr He Phe Ser Asn Asp He 
20 25 30 

Ser Met Gin Phe Gly Leu Asp Lys Cys Lys Thr 
35 40 

INFORMATION FOR SEQ TD NO: 203: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: .11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(i:<) FEATURE: 

(A) NAME/ KEY : Peptide 

(3) LOCATION: 1 . . 11 - . ^ _ 

(0) OTHER INFORMATION : /note- "motif E peptide from 

Dong ( LINE) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 
Lvs Cvs L<=>u Tyr Lys Tvr Leu Gly Phe Gin Gin 

L * 5 ; 10 

INFORMATION FOR SEQ ID NO: 204: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(3) LOCATION: 1 . . 34 . 

(D) OTHER INFORMATION: /note- "motif A peptide from 

■ Saccharomyces cerevisiae cytocnrome 

oxidase group II intron 1-encoded 

mitochondrial protein" 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:204: 

Phe Giv Gly Ser Asn T.rp Phe Arg Glu Val Asp Leu Lys Lys Cys Phe 
1 " J 5 10 15 
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Asp Thr He Ser His Asp Leu He lie Lys Glu Leu Lys Arg Tyr lie 
20 25 



Ser Aso 



INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) . MOLECULE TYPE: peptide 

(ix) "FEATURE : 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1..35 

0) OTHER INFORMATION: /note- "motif 3 peptide from 
[ ] Saccharomyces cerevisiae cytocnrome 

oxidase group II intron 1-encoded 
mitochondrial protein" 

Ui) SEQUENCE DESCRIPTION: SEQ 10 NO:205: 
His val Pro Val Gly Pro Arg Val Cys Val Gin Gly Ala Pro Thr Ser 



1 5 
Pro Ala 



L-u Cys Asn Ala Val Leu Leu Arg Leu Asp Arg Arg Leu Ala 

25 30 



20 



Gly Leu Ala 
3 5- 



IM FORMATION FOR SEQ 10 NO: 20 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

( ix) FEATURE : 

(A) NAME /KEY : Peptide 

(B) LOCATION; 1..25 

(D) OTHER INFORMATION: /note- "motir D peptide from 
1 ' Saccharomyces cerevisiae cytocnrome 

oxidase group II intron 1-encoded 
mitochondrial protein" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 

Lys lie lie Lys Arg Asp Leu Asn Asn Phe Leu Asn Ser Leu Gly Leu 
1 5 10 15 

Thr lie A5n Glu Glu Lys Thr Leu lie 
20 . 25 



INFORMATION FOR SEQ ID NO: 207: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
■ (3) LOCATION: 1^34 

(D) OTHER INFORMATION: /note- "motif A peptide from HIV RT " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 207 : 

Leu Lys Lys Lys Lys Ser Val Thr Val Leu Asp Val Gly Asp Ala Tyr 
1 5 10 15 

Phe Ser Val Pro Leu Asp Glu Asp Phe Arg Lys Tyr Thr Ala Phe Thr 
20. 23 30 

lie Pro 



INFORMATION FOR SEQ ID NO: 203: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 35 amino acids 
(3) TYPE: amino acid 
{C} STRAMDEDMESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION : 1 . . 35 

(0) OTHER INFORMATION: /note- "motif 3 peptide from HIV RT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: 

Gly lie Arg Tyr Gin Tyr Asn Val Leu Pro Gin Gly Tro Lys Gly Ser 
1 5. 10 15 

Pro Ala He Phe Gin Ser Ser Met Thr Lys lie Leu Glu Pro Phe Arg 
20 '25 30 
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Lys Gin Asn 
35 

(2) INFORMATION FOR SEQ ID MO: 209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..17 f ^ m „ TV RT " 
(D) OTHER INFORMATION : /note- "motif C peptide from HIV RT 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:209:' 

He Tyr Gin Tyr Met Asp Asp Leu Tyr Val Gly Ser His Leu Glu He 
1 5 „ 

Gly 

(2) INFORMATION FOR SEQ ID MO: 2 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 
(3) TYPE: amino acid 
(C) STRAMDEDMESS : 
(0} TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



;ix) FEATURE: 

(A) NAME/KEY: Peptide 

Id! OTHER^N FORMAT ION : /note- "motif D and E peptide from 

HIV RT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: 
His Arg Thr Lys He Glu Glu Leu Arg Gin His Leu Leu Arg Trp Gly 
1 5 10 

Leu Thr Thr Pro Asp Lys Lys His Gin Lys Glu Pro Pro Phe Leu Trp 



20 

Met Gly Tyr Glu Leu 
35 



25 
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INFORMATION FOR SEQ ID MO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: . linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . . 38 

(D) OTHER INFORMATION: /note-* "motif C and D peptide from 

Saccharomyces cerevisiae 
' ESTp (L8543.12)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: 

lie Leu Lys Leu Ala Asp Asp Phe Lea lie lie Ser Thr A3p Gin Gin 

1 -5 10 15 

Glh Val lie Asn He Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr 
20 2S 30 

Asn Ala Lys Ala Asn. Arg 
35 



INFORMATION FOR SEQ ID NO: 212: 

' (ij SEQUENCE CHARACTER I ST ICS : 

(A) LENGTH: 11 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(0) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION : 1 . . 11 

(D) OTHER INFORMATION: /note* "motif £ peptide from 

Saccharomyces cerevisiae 
ESTp (L8543,12)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

lie Arg Ser Lys Ser Ser Lys Gly lie Phe Arg 
1 5 10 
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(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

( ii ) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

Leu Gin Lys Gin Leu Glu Phe Tyr Phe Ser Asp Ala Asn Leu Tyr Asn 
1 ' 5 10 15 

Asp Ser Phe Leu Arg Lys Leu Val Leu Lys Ser Gly Glu Gin Arg Val 
20 25 30 

Glu He Glu Thr Leu Leu Met 
35 



(2) INFORMATION FOR SEQ ID NO:214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

{ B ) TYPE: amino acid 

(C) STRANDEDNESS : 

{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 214: 

lie Cys His Gin Xaa Glu Tyr Tyr Phe Gly Asp Phe Asn Leu Pro Arg 
I 5 ^ 10 15 

Asp Lys Phe Leu Lys Glu Gin He Lys Leu Asp Glu GLy Trp Val Pro 
20 * 25 30 

Leu Glu He Met He Lys 
35 



(2) INFORMATION EOR SEQ ID NO: 2 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

lie Cys Glu Gin He Glu Tvr Tyr Phe Gly Asp His Asn Leu Pro Arg 
1 5 " 10 15 
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Asp Lys Phe Leu Lys Gin Gin He Leu Leu Asp Asp Gly Trp Val Pro 
20 25 30 

Leu Glu Thr Met He Lys 
35 



(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

lie Leu Arg Gin Val Glu Tyr Tyr Phe Gly Asp Ala Asn Leu Asn Arg 
1 5 10 15 

Asp Lys Phe Leu 'Arg Glu Gin He Gly Lys Asn Glu Asp Gly Trp Val 
20 25 30 

Pro Leu Ser Val- Leu Val Thr 
35 



(2) INFORMATION FOR SEQ ID MO: 2 17: 

(!) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 38 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
MD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO : 217: 

Cys Leu Lys Gin Val Glu Phe Tyr Phe Ser Glu Phe Asn Phe Pro Tyr 
1 5 10 15 

Asp Arg Phe Leu Arg Thr Thr Ala Glu Lys Asn Asp Gly Trp Val Pro 
20 25 30 

lie Ser Thr lie Ala Thr 
35 



(2) INFORMATION FOR SEQ ID NO: 213: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



145 



(ii)' MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(3) LOCATION: 92. .2251 

(D) OTHER INFORMATION: /codoa- (seq: "taa", aa : Gin) 

/codon= (seq: "tag", aa: Gin) 
/codon- (seq: "tga", aa : TRM) 
/product^ "Tetrahymena therruophila 
80 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218.: 
AACTCATTTA ATTACTAATT TAATCAACAA GATTGATAAA AAGCAGTAAA TAAAACCCAA 60^ 

TAGATTTAAT TTAGAAAGTA TCAATTGAAA A ATG GAA ATT GAA AAC AAC TAA 112 

Met' Glu lie Glu Asn Asa Gin 
1 5 

GCA CAA _ TAG CCA AAA GCC. GAA AAA TTG TGG TGG GAA CTT GAA TTA GAG 160 
-la Gin Gin Pro Lys Ala Glu Lys Leu Trp Trp Glu Leu Glu Leu Glu 
10 15 20 

-V~G CAA GAA AAC CAA AAT GAT ATA TAA GT7~ AGG GTT AAG ATT GAC GAT 208' 
v e: Gin Glu Asn Gin Asn Asp He Gin Val' Arg Val Lys He Asp Asp 
25 30 35 

CCT AAG CAA TAT CTC GTG AAC GTC ACT GCA GCA TGT TTG TTG TAG GAA 256 
Lvs Gin Tyr Leu Val Asn Val Thr Ala Ala Cys Leu Leu Gin Glu 
40 * 45 50 55 



;GT ACT TAC TAC TAA GAT AAA GAT GAA AG A. AG A TAT ATC ATC ACT AAA 
;:.v Ser Tyr Tyr Gin Asp Lys Asp Glu Arg Arg Tyr He lie Thr Lys 
60 65 70 
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GCA CTT CTT GAG GTG GCT GAG TCT GAT CCT GAG TTC ATC TGG TAG TTG 3o2 
•~~a L-u Leu Glu Val Ala Glu Ser Asp Pro Glu Phe lie Cys Gin Leu~ 
75 80 .85 



r.-a G"C TA r ATC CGT AAT GAA CTT TAG ATC AG A ACT ACC ACT AAC TAC 4 00 

IS'a Val Tyr He Arg Asn Glu Leu Tyr lie Arg Thr Thr Thr Asn Tyr 
90 - 95 100 

ATT GTA GCA TTT TGT GTT GTC CAC AAG AAT ACT CAA CCA TTC ATC GAA 
He Val Ala Phe Cys Val Val His Lys Asn Thr Gin Pro Phe lie Glu 
105 HO 115 

AAG TAC TTC AAC AAA GCA GTA CTT TTG CCT AAT GAC TTA CTG GAA GTC 
Lys Tyr Phe Asn Lys Ala Val Leu Leu Pro Asn Asp Leu Leu Glu Val 
120 .125 130 135 

T-t G *A TT^ GCA TAG GTT CTC TAT ATT TTT GAT GCA ACT GAA TTC AAA 
Cvs Glu Phe Ala Gin Val Leu Tyr He Phe Asp Ala Thr Glu Phe Lys 
140 14 3 150 

A iT T t G TAT CTT GAT AGG ATA CTT TCA TAA GAT ATT CGT AAG GAA CTC 592 
i 3n l-u Tvr Leu Asp Arg He Leu Ser Gin Asp He Arg Lys Glu Leu 
155 160 165 



448 



496 



54 4 
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*CT TTC CGT AAG TGT TTA CAA AG A TGC GTC AGA AGC AAG TTT TCT GAA 
Thr Phe Arg Lys Cys Leu Gin Arg Cys Val Arg Ser Lys Phe Ser GLu 
170 175 

TTC AAC GAA- TAG TAA CTT GGT AAG TAT TGC ACT GAA TCC TAA CGT AAG 
»h. Asn Glu Tyr Gin Leu Gly Lys Tyr Cys Thr Glu Ser Gin Arg Lys 
185 19° 195 

AAA ACA ATG TTC CGT TAG CTC TCA GTT ACC AAC AAG TAA AAG TGG GAT 
Lvs Thr Met Phe Arg Tyr Leu Ser Val Thr Asn Lys Gin Lys Trp.Asp 
200 205 210 

— ACT AAG AAG AAG AGA AAA GAG AAT CTC TTA ACC AAA CTT TAG GCA 
C-V: ?hr L^s Lys Lys Arg Lys -Glu Asn Leu Leu Thr Lys Leu Gin Ala 

220 225 

— A*r c BA TCT GAA GAT AAG TCC AAG AGA GAA ACT GGA GAC ATA ATG 
Lys Git Ser Glu. Asp Lys Ser Lys Arg Glu Thr Gly Asp He Met 

235 240 245 

GTT GAA GAT GCA ATC AAG GCT TTA AAA CCA GCA GTT ATG AAG AAA 
Val Glu So Ala lie Lys Ala Leu Lys Pro Ala Va Met. Lys Lys 
250 , 255 260 

--- GCC ^G AGA TAG AAT GCC ATG AAG AAA CAC ATG AAG GCA CCT AAA 
-a Lvs Arg Gin Asn Ala Met Lys Ly* His Met Lys Ala Pro Lys 
265 ' . - 270 2 75 

— "CCT * aC TCT ACC TTG GAA TCA AAG TAG TTG ACC TTC AAG GAT CTC 
-■I fro Asn S« ?hr Leu Glu Ser Lys Tyr Leu Thr Phe Lys Asp Leu 

28 5 
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— -,r T T n TGC C*T ATT TCT GAG CCT AAA GAA AGA GTC TAT AAG ATC 
-"I Tys Phe Cys His He Ser Glu Pro Lys Glu Arg Val. Tyr Lys He 
300 j0r ~ 



qqt n» "AAA TAG CCT AAG ACC GAA. GAG GAA TAG AAA GCA GCC TTT 
Giy l7s Lvs Tyr Pro Lys Thr Glu Glu Glu Tyr Lys Ala Ala Pne 



315 



320 



_ TrT QCA TCT GCA CCC TTC AAT CCT GAA TTG GCT GGA AAG CGT 

I--;, sir S a ser Ala Pro Phe Asn Pro Glu Leu Ala Gly Lys Arg 

330 335 

,- G a,G ATT GAA ATC TCT AAA ACA TGG GAA AAT GAA CTC ACT GCA AAA 
vie Lys lie Glu U« Ser Lys Thr Trp Glu Asn Glu Leu Ser Ala Lys 
345 350 3 55 

rrr ,nr ACT GCT GAG GTT TGG GAT AAT TTA ATT TCA AGC AAT TAA CTC 
l^l £5 S All Glu Val Trp Asp Asn Leu lie Ser Ser Asn Gin eu 
350 365 

^ TAT ATG GCC ATG TTA CGT AAC TTG TCT AA.C ATC TTA AAA GCC GGT 
Tyr Met Ala Met Leu Arg Asn Leu Ser Asn He Leu Lys Ala Gly 
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TCA GAT ACT ACA CAC TCT ATT GTG ATC AAC AAG ATT TGT GAG CCC 
; /a [ ser Asp Thr Thr His Ser He Val lie Asn Lys He Cy» Glu 

395 400 



640 



688 



■ 736 



784 



832 



880 



928 



976 



1024 



1072 



1120 



1168 



1216 



1264 



1312 
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AAG GCC GTT GAG AAC TCC AAG ATG TTC CCT CTT CAA TTC TTT AGT GCC 13 60 

Lys Ala Val Glu Asn Ser Lys Met Phe Pro Leu Gin Phe Phe Ser Ala 
410 415 420 

ATT GAA GCT GTT AAT GAA GCA GTT ACT AAG GGA TTC AAG GCC AAG AAG 14 08 

lie Glu Ala Val Asn Glu Ala Val Thr Lys Gly Phe Lys Ala Lys Lys 
425 430 435 

AGA GAA AAT ATG AAT CTT AAA GGT CAA ATC GAA GCA GTA AAG GAA GTT 14 5 6 

Arg.Glu Asn Met Asn Leu Lys Gly Gin lie Glu Ala Val Lys Glu Val 
440 445 450 455 

GTT GAA AAA ACC GAT GAA GAG AAG AAA GAT ATG GAG TTG GAG TAA ACC 150 4 

Val Glu Lys Thr Asp Glu Glu Lys Lys Asp Met Glu Leu Glu Gin Thr 
-4 60 4 65 470 

GAA GAA GGA GAA TTT GTT AAA GTC AAC GAA GGA ATT GGC AAG CAA TAC 15 52 

Glu Glu Gly Glu Phe Val Lys Val Asn Glu Gly lie Gly Lys Gin Tyr 
475 480 485 

ATT AAC TCC ATT GAA CTT GCA ATC AAG ATA GCA GTT AAC AAG AAT TTA 1600 
lie Asn Ser lie Glu Leu Ala lie Lys lie Ala Val Asn Lys Asn Leu 
490 495 500 

GAT GAA ATC AAA GGA CAC ACT GCA ATC TTC TCT GAT GTT TCT GGT TCT 1648 
Asp Glu He Lys Gly His Thr Ala He Phe- Ser Asp Val Ser Gly Ser 
505 '510 - 515 

ATG AGT ACC TCA ATG TCA GGT GGA GCC AAG AAG TAT GGT TCC GTT CGT 1696 
Met Ser Thr Ser Met Ser Gly Gly Ala Lys Lys Tyr Gly Ser Val Arg 
320 525 530 535 

ACT TGT CTC GAG TGT GCA TTA GTC CTT GGT TTG ATG GTA AAA TAA CGT 17 4 4 

Thr Cys Leu Glu Cys Ala Leu Val Leu Gly Leu Met Val Lys Gin Arg 
540 * 545 550 

TGT GAA AA3 TCC TCA TTC TAC ATC TTC AGT TCA CCT AGT TCT CAA TGC 1792 
Cys Glu' Lys Ser Ser Phe Tyr lie Phe Ser Ser Pro Ser Ser Gin Cys 
555 560 565 

AAT AAG TGT TAC TTA GAA GTT GAT CTC CCT GGA GA.C GAA CTC " CGT CCT "13 4 0 

Asn Lys Cys Tyr Leu Glu Val Asp Leu Pro Gly Asp Glu Leu Arg Pro 
570 575 580 

TCT ATG TAA AAA CTT TTG CAA GAG AAA GGA AAA CTT GGT GGT GGT ACT 188 8 

Ser Met Gin Lys Leu Leu Gin Glu Lys Gly Lys Leu Gly Gly Gly Thr 
535 590 595 

GAT TTC CCC TAT GAG TGC ATT GAT GAA TGG ACA AAG AAT AAA ACT CAC 193 6 

Asp Phe Pro Tyr Glu Cys He Asp Glu Trp Thr Lys Asn Lys Thr His 
600 605 * 610 615 

GTA GAC AAT ATC GTT ATT TTG TCT GAT ATG ATG ATT GCA GAA GGA TAT 198 4 

Val Asp Asn lie Val lie Leu Ser Asp Met Met He Ala Glu Gly Tyr 
620 625 630 

TCA GAT ATC AAT GTT AGA GGC AGT TCC ATT GTT AAC AGC.ATC AAA AAG 2032 
Ser Asp He Asn Val Arg Gly Ser Ser He Val Asn Ser lie Lys Lys 
■635 64 0 .64 5 
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TAC AAG GAT GAA GTA AAT CCT AAC ATT AAA ATC TTT GCA GTT GAC TTA 208 0 

Tyr Lys Asp Glu Val Asn Pro Asa lie Lys lie Phe Ala Val Asp Leu 
650 655 660 

GAA GGT TAC GGA AAG TGC CTT AAT CTA GGT GAT GAG TTC " AAT GAA AAC 2123 
Glu Gly Tyr Gly Lys Cys Leu Asn Leu Gly Asp Glu Phe Asn Glu Asa 
665 670 675 

AAC TAC ATC AAG ATA TTC GGT ATG AGC GAT TCA ATC TTA AAG TTC ATT 2176 
Asn Tyr He Lys lie Phe Gly Met Ser Asp Ser He Leu Lys Phe He 
680 685 690 695 

TCA GCC AAG CAA GGA GGA GCA AAT ATG GTC GAA GTT ATC AAA AAC TTT 222 4 

Ser Ala Lys Gin Gly Gly Ala Asn Met Val Glu Val He Lys Asn Phe 
700 705 710 

GCC CTT CAA AAA ATA GGA CAA AAG TGAGTTTCTT GAGATTCTTC TATAACAAAA 2278 
Ala Leu Gin Lys He Gly Gin Lys 
715 

ATCTCACCCC ACTTTTTTGT TTTATTGCAT AGCCATTATG AAATTTAAAT TATTATCTAT _ 2 3 38 

TTATTTAAGT TACTTACATA GTTTATGTAT CGCAGTCTAT TAGCCTATTC AAATGATTCT 2 3 98 

GCAAAGAACA AAAAAG A T T A AAA 24 21 

(2) INFORMATION FOR SEQ ID NO:219:' 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 719 amino acids 
(3) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE:' protein 

(■/.!) SEQUENCE DESCRIPTION: SEQ ID.MO:219: 

Met Glu lie Glu Asn Asn Gin Ala Gin Gin Pro Lys Ala Glu Lys Leu 
1 5 10 15 

Trp Trp Glu Leu Glu Leu Glu Met Gin Glu Asn Gin Asn Asp lie Gin 
■ 20 25 30 

Val Arg Val Lys He Asp Asp Pro Lys Gin Tyr Leu Val Asn Val Thr 
35 40 45 

Ala Ala Cys Leu Leu Gin Glu Gly Ser Tyr Tyr Gin Asp Lys Asp Glu 
50 55 60 

Arg Arg Tyr He He Thr Lys Ala Leu Leu Glu Val Ala Glu Ser Asp 
65 70 ' 75 30 

Pro Glu Phe He Cys Gin Leu Ala Val Tyr lie Arg Asn Glu Leu Tyr 
85 90 95 

He Arg Thr Thr Thr Asn Tyr He Val Ala Phe Cys Val Val His Lys 
100 105 110 

Asn Thr Gin Pro Phe He Glu Lys Tyr Phe Asn Lys Ala Val Leu Leu 
115 120 125 
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Pr-o Asn Asp Leu Leu Glu Val Cys Glu Phe Ala Gin Val Leu Tyr He 
130 135 140 

°he Aso Ala Thr Glu Phe Lys Asn Leu Tyr Leu Asp Arg He Leu Ser 
" L 45 ' 150 155 160 

Gin Asp He Arg Lys Glu Leu Thr Phe Arg Lys Cys Leu Gin Arg Cys 
165 170 175 

Val Arq Ser Lys Phe Ser Glu Phe Asn Glu Tyr Gin Leu Gly Lys Tyr 
180 185 190 

Cys Thr Glu Ser Gin Arg Lys Lys Thr Met Phe Arg Tyr Leu Ser Val 
195 200 205 

Thr Asn Lys Gin Lys Trp Asp Gin Thr Lys Lys Lys Arg Lys Glu Asn 
210 215 220 

T-eu L»u Thr Lys Leu Gin Ala He Lys Glu Ser Glu Asp Lys Ser Lys 
225 " 230 235 240 

A-g Glu Thr Gly Asp He- Met Asn Val Glu Asp Ala lie Lys Ala Leu 
. 245 250 2S5 

Lys P-o Ala Val Met Lys Lys He Ala Lys Arg Gin Asn Ala Met Lys 
260 265 ' . 270 

rys His Met Lys Ala Pro Lys He Pro Asn Ser Thr Leu Glu Ser Lys 
275 280 285 

Tyr Leu Thr Phe Lys Asp Leu He Lys Phe Cys His He Ser Glu Pro 
290 295 300 

- < "s Glu P-c v*l Tyr Lys He Leu' Gly Lys Lys Tyr Pro Lys Thr Glu 
3Q5 " ' 310 315 320 

G'u G'u Tyr Lvs Ala Ala Phe Gly Asp Ser Ala Ser Ala Pro Phe Asn 
325 330 335 

o-o Giu L-u aia Gly Lys Arg Met -Lys He Glu He Ser Lys Thr Trp 
340 345 3d0 

Glu -Asn Glu Leu Ser Ala Lys Gly Asn Thr Ala Glu Val Trp Asp Asn 
355 ' 360 - 365 

Leu lie Ser Ser Asn Gin Leu Pro Tyr Met Ala Met Leu Arg Asn Leu 
370 375 380 

S-r asn He Leu Lys Ala Gly Val Ser Asp Thr Thr His Ser He Val 
385 " 390 395 400 

Tie' Asa Lys He Cys Glu Pro Lys Ala Val Glu Asn Ser Lys Met Phe 
405 410 415 

Pro Leu Gin Phe Phe Ser Ala He Glu Ala Val Asn Glu Ala Val Thr 
420 425 430 

Lvs Glv Phe Lys Ala Lys Lys Arg Glu Asn Met Asn Leu Lys Gly Gin 
■ 435 440 445 
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He Glu Ala Val Lys Glu Val Val Glu Lys Thr Asp Glu Glu Lys Lys 
450 455 460 

Asp Met Glu Leu Glu Gin Thr Glu Glu Gly Glu Phe Val Lys Val Asn 
465 ■ 470 4.75 " 480 

Glu Gly He Gly Lys Gin Tyr He Asn Ser He G1U Leu Ala He Lys 
485 490 495 

He Ala Val Asn Lys Asn Leu Asp Glu He Lys Gly His Thr Ala He 
500 505 510 

Phe Ser Asp Val Ser Gly Ser Met Ser Thr Ser Met Ser Gly Gly Ala 
515 520 525 

Lys Lys Tyr Gly Ser Val Arg Thr Cys Leu Glu Cys Ala Leu Val Leu 
530 535 540 

Glv Leu Met Val Lys Gin Arg Cys Glu Lys Ser Ser Phe Tyr He Phe 
545 550 555 560 

S-r Ser Pro Ser Ser Gin Cys Asn Lys Cys Tyr Leu Glu Val Asp Leu 
565 570 575 

Pro Glv Aso Glu Leu Arg Pro Ser Met Gin Lys Leu Leu Gin Glu Lys 
' 580 585 " 590 

Gly Lys Leu Gly Gly Gly Thr Asp Phe Pro Tyr Glu Cys He Asp Glu 
595 600 605 

Tro Thr Lys Asn Lys Thr His Val Asp Asn He Val He Leu Ser Asp 
610 615 620 

M-r M-t II* Ala Glu Gly Tyr Ser Asp He Asn Val Arg Gly Ser Ser 
525 " ■ 630 635 " 640 

11= Val Asn Ser He" Lys Lys Tyr Lys As? Glu Val Asn Pro Asn He 
645 650 655 

fys H° Phe Ala Val Asp Leu Glu Gly Tyr Gly Lys Cys Leu Asn Leu 
660 665 670 

Gly Asp Glu Phe Asn Glu Asn Asn Tyr He Lys He Phe Gly Met Ser. 
675 680 685 

Aso Ser He Leu Lys Phe He Ser Ala Lys Gin Gly Gly Ala Asn Met 
690 695 7 00 

Val Glu Val He Lys Asn Phe Ala Leu Gin Lys He Gly Gin Lys 
705 710 715 



(2) INFORMATION FOR SEQ ID NO: 220: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2329 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE: DMA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 47.-2665 ^ ^ 

(D) OTHER INFORMATION: /codon- (seq: taa , aa . mnj 



/codon- (seq 
/codon- (seq 



"tag", aa : Gin) 
^. "tga", aa: TRiM) 
/product- "Tetrahymena thermophila 
95 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:220: 
TCAATACTAT TAATTAATAA ATAAAAAAAA GCAAACTACA AAGAAA ATG TCA AGG 



1 



CGT AAC TAA AAA AAG CCA TAG GCT CCT ATA GGC AAT GAA AC A AAT CTT 
Arg aIS Gin Lys Lys Pro Gin Ala Pro lie Gly Asn Glu Thr Asn Leu 
5 10 15 

G ,T TTT GTA TTA CAA AAT CTA GAA GTT TAG AAA AGC CAG ATT GAG CAT 
:o Phe Val Leu Gin Asn Leu Glu Val Tyr Lys Ser Gin lie Glu His 
Yo 25 30 

•?-» t ^ G ACC TAG TAG TAA TAG ATC AAA' GAG GAG GAT CTC AAG CTT TTA 
£r Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu Lys Leu Leu 
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*AG TTC AAA AAT TAA GAT TAG GAT GGA AAC TCT GGC AAC GAT GAT GAT 
£s Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn Asp Asp „sp 
55 60 

r*-r GAA GA* A AC AAC TCA AAT AAA. TAA TAA GAA TTA TTA AGG AG A GTC 
Asp Git Glu A~sn Asn Ser Asn Lys Gin Gin Glu Leu Leu Arg Arg Val 

- -T TAG ATT AAG TAG CAA. GTT TAA TTG ATA AAA AAA GTT GGT TCT AAG 
'•Vi cm IL = Lys Gin Gin Val Gin Leu He Lys Lys Val G. y Se. Lys 
85" 9° 95 

r-TA GAG AAA GAT TTG AAT TTG AAC GAA GAT GAA AAC AAA AAG AAT GGA 
Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Ly a L ya A=n Cly 

loo - 105 110 

CTT TCT GAA TAG CAA GTG AAA GAA GA.G TAA TTA AGA ACG ATT ACT GAA 

r» u ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr U~ T ^ 
120 I 25 

raA TAG GTT AAG TAT TAA AAT TTA GTA TTT AAC ATG GAC TAC CA.G TTA 
Glu III vll Lys T^yr Gin Asn Leu Val Phe Asn Met Asp Tyr Gin Leu 
135 140 

se s: e s; hi % ss s: us e s e s - s - 

•• . L55 l&u 
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„ CT rflT ACT GAA AAA TGG TTT GAA ATA TCT CAT GAC CAA AAA AAT TAT 
if r A.fp rnr Glu Trp Phe Glu He Ser His Asp Gin Lys Asn Tyr 



55 



103 



151 



199 



247 



29 5 



343 



391 



439 



437 



535 



583 



165 ' 170 
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GTA TCA ATT TAG GCC AAC TAA AAG ACA TCA TAT TGT TGG TGG CTT AAA 6 31 

Val Ser lie Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp Trp Leu Lys 
180 185 190 195 

GAT TAT TTT AAT AAA AAC AAT TAT . GAT CAT CTT AAT GTA AGC ATT AAC 679 
Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val Ser lie Asn 
200 205 . 210 

AG A CTA GAA ACT GAA GCC GAA TTC TAT GCC TTT GAT GAT TTT TCA CAA 727 
Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp Phe Ser Gin 
215 220 225 

ACA ATC AAA CTT ACT AAT AAT TCT TAC TAG ACT GTT AAC ATA GAC ~GTT 77 5 

Thr lie Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn lie Asp Val 
2-30 235 240 

AAT TTT GAT AAT AAT CTC TGT ATA CTC GCA TTG CTT AGA TTT TTA TTA 82 3 

Asn Phe Asp Asn Asn Leu Cys lie Leu Ala Leu Leu Arg Phe Leu Leu 
245 250 255 

TCA CTA GAA AGA TTC AA.T ATT TTG AAT ATA AGA TCT TCT TAT ACA AGA ^ 871 
Ser Leu Glu Arg Phe Asn He Leu Asn He Arg Ser Ser Tyr Thr Arg 
260 265 270 275 

AAT TAA. TAT AAT TTT GAG " AAA ATT GGT GAG CTA CTT GAA ACT ATC TTC 919 
Asn "Gin Tyr Asn Phe Glu Lys lie Gly Glvl Leu Leu Glu Thr lie Phe 
280 - 285 290 

GCA GTT GTC TTT TCT CAT CGC CAC TTA CAA GGC ATT CAT TTA CAA GTT 967 
Ala Val Val Phe Ser His Arg His Leu Gin Gly He His Leu Gin Val 
295 300 305 

CCT TGC GAA. GCG TTC TAA TAT TTA GTT AAC. TCC TCA TCA TAA ATT AGC 1015 
Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser Gin lie Ser 
310 3L5 3-20 

GTT AAA GAT AGC TAA TTA TAG GTA TAC TCT TTC TCT ACA GAC TTA AAA 10 6 3 

Val Lys Asp Ser Gin. Leu Gin Val Tyr Ser Phe Ser Thr Asp Leu Lys 
325 ~ 330 335 

TTA GTT GAC ACT AAC AAA GTC CAA GAT TAT TTT AAG TTC TTA TAA GAA 1111 
Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe Leu Gin Glu 
340 345 3 50 3 55 ^ 

TTC CCT CGT TTG ACT CAT GTA AGC TAG TAG GCT ATC CCA . GTT AGT GCT 1159 
Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala lie Pro Val Ser Ala 
360 355 370 

ACT AAC GCT GTA GAG AAC CTC AAT GTT TTA CTT AAA AAG GTC AAG CAT 1207 
Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys Val Lys His 
375 380 385 

GCT AAT CTT AAT TTA GTT TCT ATC CCT ACC TAA TTC AA.T TTT GAT TTC 12 55 

Ala Asn Leu Asn Leu Val Ser lie Pro Thr Gin Phe Asn Phe Asp Phe 
390 395 400 

TAC TTT GTT AAT TTA TAA CAT TTG AAA TTA GAG TTT GGA TTA GAA CCA 1303 
Tyr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly Leu Giu Pro 
405 410 415 
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AAT ATT TTG'ACA AAA CAA AAG CTT GAA AAT CTA CTT TTG AGT ATA AAA 1351 
Asn He Leu Thr Lys Gin Lys Leu Glu Asn Leu Leu Leu Ser He Lys 
420 425 430 435 

TAA TCA AAA AAT CTT AAA TTT TTA AG A TTA AAC TTT TAC ACC TAG GTT 1399 
Gin Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr Thr Tyr Val 
440 445 450 

GCT TAA GAA ACC TCC AG A AAA CAG ATA TTA AAA CAA GCT ACA ACA ATC 14 47 

Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala Thr Thr He 
455 460 465 

AAA AAT CTC AAA AAC AAT AAA AAT CAA GAA GAA ACT CCT GAA ACT AAA 1495 
Lys Asn Leu Lys Asn Asn Lys Asn Gin Glu Glu Thr Pro _Glu Thr Lys 
470 - 475 480 

GAT GAA ACT CCA AGC GAA AGC ACA' AGT GGT ATG AAA TTT TTT GAT CAT 154 3 

Asp Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe Phe Asp His 
485 490 4 95 

CTT TCT GAA TTA ACC GAG CTT GAA GAT TTC AGC GTT AAC TTG TAA GCT 1591 
Leu Ser Glu Leu~ Thr Glu Leu Glu Asp Phe Ser Val Asn Leu Gin Ala 
500 505 510 515 

ACC CAA GAA ATT TAT GAT AGC TTG CAC AAA CTT TTG ATT .AGA TCA ACA 1639 
Thr Gin Glu tie Tyr Asp Ser Leu His Lys* Leu Leu He Arg Ser Thr 
520 * 525 - 530 

AAT TTA AAG AAG TTC AAA TTA AGT TAC AAA TAT GAA ATG GAA AAG AGT 168 7 

Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met Glu Lys Ser 
535 540 545 

AAA ATG GAT ACA TTC ATA GAT CTT AAG AAT ATT TAT GAA. ACC TTA AAC 17 35 

Lys Met Asp Thr Phe He Asp Leu Lys Asn lie Tyr Glu Thr Leu Asn 
550 * 555 560 

AAT CTT AAA AGA TGC TCT GTT AAT ATA TCA AAT CCT CAT GGA AAC ATT 17 33 

Asn Leu Lys Arg Cys Ser Val Asn He Ser Asn Pro His Gly Asn lie 
565 - 570 575 

TCT TAT GAA CTG ACA AAT AAA GAT TCT ACT TTT TAT AAA TTT AAG CTG 18 31 

Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys Phe Lys Leu 
530 . 535 " 590 595 

ACC TTA AAC TAA GAA TTA TAA CAC GCT AAG TAT, ACT TTT AAG TAG AAC 18 7 9 

Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe Lys Gin Asn 
600 605 610 

GAA TTT TAA TTT AAT AAC GTT AAA AGT GCA AAA ATT GAA TCT TCC TCA 192 7 

Glu Phe Gin Phe Asn Asn Val. Lys Ser Ala Lys lie Glu Ser Ser Ser 
615 620 625 

TTA GAA AGC TTA GAA GAT ATT GAT AGT CTT TGC AAA TCT ATT GCT TCT 197 5 

Leu Glu Ser Leu Glu Asp lie Asp Ser Leu Cys Lys Ser He Ala Ser 
630 ' 635 640 

TGT AAA AAT TTA CAA AAT GTT AAT ATT ATC GCC AGT TTG CTC TAT CCC 202 3 

Cys Lys Asn Leu Gin Asn Val Asn He He Ala Ser Leu Leu Tyr Pro 
645 650 655 
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AAC AAT ATT TAG AAA AAT CCT TTC AAT AAG CCC AAT CTT CTA TTT TTC 2071 
Asn Asn lie Gin Lys Asn Pro Phe Asn Lys Pro Asn Leu Leu Phe Phe 
660 665 670 675 

AAG CAA TTT GAA TAA TTG AAA AAT TTG GAA AAT GTA TCT ATC AAC TGT 2119 
Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser lie Asn Cys 
680 685. 690 

ATT CTT GAT CAG CAT ATA CTT AAT TCT ATT TCA GAA TTC TTA GAA AAG 2167 
lie Leu Asp Gin His lie Leu Asn Ser lie Ser Glu Phe Leu Glu Lys 
695 700 705 

AAT AAA AAA ATA AAA GCA TTC ATT TTG AAA AGA TAT TAT TTA TTA CAA 2 215 

Asn Lys Lys lie Lys Ala Phe lie Leu Lys Arg Tyr Tyr Leu Leu Gin 
710 . 715 720 

TAT TAT CTT GAT. TAT ACT AAA TTA TTT AAA AC A CTT CAA TAG TTA CCT 2263 
Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr. Leu Gin Gin Leu Pro 
725 ' 730 735 

GAA TTA AAT TAA GTT TAC ATT AAT TAG CAA TTA GAA GAA TTG ACT GTG 2311 
Glu Leu Asn Gin Val Tyr lie Asn Gin Gin Leu Glu Glu Leu Thr Val 
740 745 750 755 

ACT GAA GTA CAT AAG- TAA GTA TGG GAA AAC CAC AAG CAA AAA GCT TTC 235 9 

Ser Glu Val His Lys Gin Val Trp Glu Asa His Lys Gin Lys Ala Phe 
760 _ 7 65 770 

TAT GAA CCA TTA TGT GAG TTT ATC AAA GAA TCA TCC TAA ACC CTT TAG 24 0 7 

Tyr Glu Pro Leu Cys Glu Phe lie Lys Glu Ser Ser Gin Thr Leu Gin 
775 780 785' 

CTA ATA GAT TTT GAC CAA AAC ACT GTA ACT GAT GAC TCT ATT AAA AAG 24 55 

Leu lie Asp Phe Asp Gin Asn Thr Val Ser Asp Asp Ser lie Lys Lys 
790 795 300 _ 

ATT TTA GAA TCT ATA TCT GAG TCT AAG TAT CAT CAT TAT TTG AGA TTG 250 3 

lie Leu Glu Ser lie Ser Glu Ser Lys Tyr His His Tyr Leu Arg Leu 
805 8 10 ' 815. 

AAC CCT AGT TAA TCT AGC AGT'TTA ATT AAA TCT GAA AAC GAA GAA ATT 25 51 

Asp. Pro Ser Gin Ser Ser Ser Leu lie Lys Ser Glu Asn Glu Glu He 
820 825 830 835 

TAA GAA CTT CTC AAA GCT TGC GAC GAA AAA GGT GTT TTA GTA AAA GCA 2599 
Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu Val Lys Ala 
840 845 850 

TAC TAT AAA TTC CCT CTA TGT TTA CCA ACT GGT ACT TAT TAC GAT TAC 2 64 7 

Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr Tyr Asp Tyr 
855 860 865 

AAT TCA GAT AGA TGG TGATTAATTA AAT ATT .AG TT T AAA T AAA T A TTAAATATTG 2 7 02 
Asn Ser Asp Arg Trp 
870 

AATA.TTTCTT TGCTTATTAT TTG AAT AAT A CAT AC AAT AG TCATTTTTAG TGTTTTGAAT 2762 

ATATTTTAGT TATTTAATTC ATTATTTTAA' GTAAATAATT ATTTTTCAAT CATTTTTTAA 2322 

AAAATCG - 2 329 
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(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 872 amino acids ■ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:221: 

Met Ser Arg Arg Asn Gin Lys Lys Pro Gin Ala Pro He Gly Asn Glu 
1 - 5 10 15 

Thr Asn Leu Asp Phe Val Leu Gin Asn Leu Glu Val Tyr Lys" Ser Gin 
20 25 30 

lie Glu His Tyr Lys Thr Gin Gin Gin Gin He Lys Glu Glu Asp Leu 
35 40 45' 

Lys Leu Leu Lys Phe Lys Asn Gin Asp Gin Asp Gly Asn Ser Gly Asn 
50 55 60 

Asp Asp Asp Asp Glu Glu Asn Asn Ser Asn Lys Gin Gin Glu Leu Leu 
65 70 ■ 75 80 

Arg Arg Val Asn Gin He Lys Gin Gin Val Gin Leu lie Lys Lys Val 
85 t 90 95 

Gly Ser Lys Val Glu Lys Asp Leu Asn Leu Asn Glu Asp Glu Asn Lys 
100 105 110 

Lys Asn Gly Leu Ser Glu Gin Gin Val Lys Glu Glu Gin Leu Arg Thr 
115 120 125 

He Thr Glu Glu Gin Val Lys Tyr Gin Asn Leu Val Phe Asn Met Asp 
130 135 140 

Tyr Gin Leu -Asp Leu Asn Glu Ser Gly Gly His Arg Arg His Arg Arg 
1^5 150 155 160 

Glu Thr Asp Tyr Asp Thr Glu Lys Trp Phe Glu He Ser His Asp-Gin 
165 170 175 

Lys Asn Tyr Val Ser He Tyr Ala Asn Gin Lys Thr Ser Tyr Cys Trp 
180 185 190 

Trp Leu Lys Asp Tyr Phe Asn Lys Asn Asn Tyr Asp His Leu Asn Val 
195 200 205 

Ser He Asn Arg Leu Glu Thr Glu Ala Glu Phe Tyr Ala Phe Asp Asp 
210 215 220 

Phe Ser Gin Thr He Lys Leu Thr Asn Asn Ser Tyr Gin Thr Val Asn 
225 230 235 240 

Tie Asp Val Asn Phe Asp Asn Asn Leu Cys He Leu Ala Leu Leu Arg 
245 250 255 

Phe Leu Leu Ser Leu Glu Arg Phe Asn lie Leu Asn He A.rg Ser Ser 
260 265 . 270 
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T yr Thr Arg Asn Gin Tyr Asn Phe Glu Lys lie Gly Glu Leu Leu Glu 
275 280 285 

Thr He Phe Ala Val Val Phe Ser His Arg His Leu Gin Gly He His 
290 295 ,300 

Leu Gin Val Pro Cys Glu Ala Phe Gin Tyr Leu Val Asn Ser Ser Ser 
305 310 315 320 

Gin He Ser Val Lys Asp Ser Gin Leu Gin Val Tyr Ser Phe Ser Thr 
325 330 335 

Aso Leu Lys Leu Val Asp Thr Asn Lys Val Gin Asp Tyr Phe Lys Phe 
340 34S 350 

W Gin Glu Phe Pro Arg Leu Thr His Val Ser Gin Gin Ala He Pro 
355 360 365 

Val Ser Ala Thr Asn Ala Val Glu Asn Leu Asn Val Leu Leu Lys Lys 
370 375 380 

Val Lys His Ala Asn Leu Asn Leu Val Ser He Pro Thr Gin Phe Asn 

335 390 .395 

sh- ^so Phe Tvr Phe Val Asn Leu Gin His Leu Lys Leu Glu Phe Gly 
" ~ * ' 405 410- 415 

r= u Glu Pro Asn He Leu Thr Lys" Gin Lys Leu Glu Asn Leu Leu Leu 
420 125 430 

= He Lys G' n Ser Lys Asn Leu Lys Phe Leu Arg Leu Asn Phe Tyr 
435 440 445 

~— T y- Val Ala Gin Glu Thr Ser Arg Lys Gin He Leu Lys Gin Ala 
4S0 455 460 

t-> T h r He Lvs Asn Leu Lys" Asn Asn Lys Asn Gin Glu Glu Thr Pro 
•55 * " ' 470 475 480 

r.:-j T h- Lvs Aso Glu Thr Pro Ser Glu Ser Thr Ser Gly Met Lys Phe 
" 485 490 495 

^ As o His Leu Ser Glu Leu Thr Glu Leu Glu Asp Phe Ser Val Asn 
500 505 510 

Leu Gin Ala Thr Gin Glu He Tyr Asp Ser Leu His Lys Leu Leu He 
515 520 525 

Arg Ser Thr Asn Leu Lys Lys Phe Lys Leu Ser Tyr Lys Tyr Glu Met 
530 535 540 

Glu Lys Ser Lys Met Asp Thr Phe He Asp Leu Lys Asn He Tyr Glu 
545 550 555 

Thr Leu Asn Asn Leu Lys Arg Cys Ser Val Asn He Ser Asn Pro His 
565 570 575 

Gly Asn He Ser Tyr Glu Leu Thr Asn Lys Asp Ser Thr Phe Tyr Lys 
530 585 590 
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Phe Lys Leu Thr Leu Asn Gin Glu Leu Gin His Ala Lys Tyr Thr Phe 
595 600 605 

Lys Gin Asn Glu Phe Gin Phe Asn Asn Val Lys Ser Ala Lys He Glu 
HO 615 620 

Ser Ser Ser Leu Glu Ser Leu Glu Asp He Asp Ser Leu Cys Lys Ser 
625 630 635 640 

He Ala Ser Cys. Lys Asn Leu Gin Asn Val Asn He lie Ala Ser Leu 
645 650 655 

Leu Tyr Pro Asn Asn He Gin Lys Asn Pro Phe Asn Lys Pro Asa Leu 
660 665 670 

Leu Phe Phe Lys Gin Phe Glu Gin Leu Lys Asn Leu Glu Asn Val Ser 
675 680 685 

lie Asn Cys lie Leu Asp Gin His lie Leu Asn Ser He Ser Glu Phe 
690 * 695 700 

i>u Glu Lys Asn Lys Lys lie Lys Ala Phe He Leu Lys Arg Tyr Tyr 
705 710 " 715 720 

L^u L^u Gin Tyr Tyr Leu Asp Tyr Thr Lys Leu Phe Lys Thr Leu Gin 
725 " " 733 735 

Gin Leu Pro Glu Leu Asn Gin Val Tyr lie Asn Gin Gin Leu Glu Glu 
740 745 750 

Leu Thr- Val Ser Glu Val His Lys Gin Val Trp Glu Asn His Lys Gin 
755 760 765 

Lys Ala. Phe Tyr Glu Pro Leu Cys Glu Phe lie Lys Glu Ser Ser Gin 
770 775 ' ^30 

'**h~' Leu Gin Leu lie Aso Phe Aso Gin Asn Thr Val Ser Asp Asp 'Ser 
: V= ^ 790 * 795 300 

He Lys Lvs He Leu Glu Ser lie Ser Glu Ser Lys Tyr His His Tyr 
305 310 " 315 

Leu Arg Leu Asn Pro Ser Gin Ser Ser Ser Leu lie Lys Ser Glu Asn 
320 825 330 

GLu Glu lie Gin Glu Leu Leu Lys Ala Cys Asp Glu Lys Gly Val Leu 
835 840 84 5 

Val Lys Ala Tyr Tyr Lys Phe Pro Leu Cys Leu Pro Thr Gly Thr Tyr 
850 8 55 860 

Tyr Asp Tyr Asn Ser Asp Arg Trp 
365 870 
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(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 

(B) LOCATION: 1 . . 88 4 

(D) OTHER INFORMATION: /note= " Saccharocuyces cerevisia* 

ESTp (L8543. 12) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

Met Lys lie Leu Phe Glu Phe lie Gin Asp Lys Leu Asp lie- Asp Leu 
L 5 10 15 

Gin Thr Asn Ser Thr Tyr Lys Glu Asn Leu Lys Cys Gly His Phe Asn 
20 - - 25 30 

Gly Leu Asp Glu lie Leu Thr Thr Cys Phe Ala Leu Pro Asn Ser Arg 
3 5' - 4 0 4 5 

Lys lie Ala Leu Pro Cys Leu Pro Gly Asp Leu Ser His Lys Ala Val 
50 55 50 

lie Asp His Cys lie lie Tyr Leu Leu Thr Gly Glu Leu Tyr Asn Asn 
63 ^ 70 75' 80 

Val Leu Thr Phe Gly Tyr Lys lie Ala Arg Asn Glu Asp Val Asn Asn 
35 - 90 95 

Ser Leu Phe Cys His Ser Ala Asn Val Asn Val Thr Leu Leu Lys Gly 
100 105 110 

Ala Ala Trp Lys Met Phe His Ser Leu Val Gly Thr Tyr Ala Phe Val 
115 120 . 125 

Asd Leu Leu lie Asn Tyr Thr Val lie Gin Phe Asn Gly Gin Phe Phe 
130 135 140 

Thr Gin lie Val Gly Asn Arg Cys Asn Glu Pro His Leu Pro Pro Lys 
145 150 155 160 

Trp Val Gin Arg Ser Ser Ser Ser Ser Ala Thr Ala Ala Gin lie Lys 
165 170 175 

Gin Leu Thr Glu Pro Val Thr Asn Lys Gin Phe Leu His Lys Leu Asn 
180 185 190 

lie Asn Ser Ser Ser Phe Phe Pro Tyr Ser Lys lie Leu Pro Ser Ser 
195 200 . 205 



Ser Ser lie Lys Lys Leu Thr Asp Leu Arg Glu Ala lie Phe Pro Thr 
210 215 220 
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Asn Leu Val Lys lie Pro Gin Arg Leu Lys Val Arg lie Asn Leu Thr 
225 230 235 240 

Leu Gin Lys Leu Leu Lys Arg His Lys A.rg Leu Asn Tyr Val Ser lie 
245 250 255 

Leu Asn Ser lie Cys Pro Pro Leu Glu Gly Thr Val Leu Asp Leu Ser 
260 265 270 

His Leu Ser Arg Gin Ser Pro Lys Glu Arg Val Leu Lys Phe lie He 
275 ' 280 285 

Val "He Leu Gin Lys Leu Leu Pro Gin Glu Met Phe Gly Ser Lys Lys 
290 295 300 

Asn Lys Gly Lys He He Lys Asn Leu Asn Leu Leu Leu Ser Leu Pro 
305 '310 315 320 

Leu Asn Gly Tyr Leu Pro Phe Asp Ser Leu Leu Lys Lys Leu Arg Leu 
325 330 335 

Lys Asp Phe Arg Trp Leu Phe He Ser Asp lie Trp Phe Thr Lys His 
340 '345 350 

Asn Phe Glu Asn Leu Asn Gin Leu Ala He -Cys Phe He Ser Trp Leu 
355 360 - 365 

Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys Thr 
370 375 380 

Glu He Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr Trp 

335 390 395 400 

Asn L-ys Leu He Thr Pro Phe lie Val Glu Tyr Phe Lys Thr Tyr Leu 
405 4 10 4 15 

Val Glu Asn Asn Val Cys Arc Ash His Asn Ser Tyr Thr Leu Ser Asn 
420 425 '130 

Phe Asn His Ser Lys Met: Arg lie He Pro Lys Lys Ser Asn Asn Glu 
4 35 4 40 4 4 5 

Ph e t rg n e xie Ala lie Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
450 " 455 460 

Thr He Tyr Lys Glu Asn His Lys Asn Ala lie Gin Pro Thr Gin Lys 
465 470 475 480 

He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys He 
485 490 495 

Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin Arg 
500 505 510 

Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met Lys 
515 520 525 



Phe Asp Val Lys Ser Cys Tyr Asp Ser He Pro Arg Met Glu Cys Met 
530 " ^ 535 540 
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Arg lie Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val Arg 
545 550 555 560 

Ser Gin Tyr Phe Phe Asn Thr Asn Thr Gly Val Leu Lys Leu Phe Asn 
565 570 ' 575 . 

Val Val Asn Ala Ser Arg Val Pro Lys Pro Tyr Glu Leu Tyr lie Asp 
580 585 590 

Asn Val Arg Thr Val His Leu Ser Asn Gin Asp Val He Asn Val Val 
595 600 605 

Glu Met Glu He Phe Lys' Thr Ala Leu Trp Val Glu Asp Lys Cys Tyr- 
610 615 620 

lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro He 
625 630 635 640 

Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys 
645 650 655 

Ala Ser Pro Ser Gin Asp Thr Leu Xle Leu Lys Leu Ala Asp Asp Phe 
660 665 670 

Leu He He Ser Thr Asp Gin- Gin Gin Val He Asn He Lys Lys Leu 
"675 " 630 - 63 5 

Ala Me c Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys 
690 - 695 700 



,eu Ala Val Ser Ser Gin Ser Asp Asp Asp Thr Val He Gin Phe 
705 710 * 715 720 

Cys Ala Me" His He Phe Val Lys Glu Leu Glu Val Trp Lys His Ser 

725 ^ 730 _ 735 

Ser Thr Met Asn Asn Phe His He Arg Ser Lys Ser Ser Lys Gly He 
740 745 ' 750 

Phe Arg Ser Leu He Ala Leu Phe Asn Thr Arg lie Ser Tyr Lys Thr 
755 760 765 

lie Asp Thr Asn Leu Asn Ser Thr Asn Thr Val Leu Met Gin He Asp 
770 775 730 

His Val Val Lys Asn He Ser Glu Cys Tyr Lys Ser Ala Phe Lys Asp' 
735 790 795 800 

Leu Ser He Asn Val Thr Gin Asn Met Gin Phe His Ser Phe Leu Gin 

805 810 815 

A.rg He He Glu Met Thr Val Ser Gly Cys Pro He Thr Lys Cys Asp 
820 825 830 

Pro Leu He Glu Tyr Glu Val Arg Phe Thr He Leu Asn Gly Phe Leu 
835 340 845 



Glu Ser Leu Ser .Ser Asn Thr Ser Lys Phe Lys Asp Asn He He Leu 
850 855 860 
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Leu Arg Lys Glu lie Gin His Leu Gin Ala Tyr lie Tyr lie Tyr He 
365 870 875 330 

His lie Vai Asn 



INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:223: 

Leu Cys Val Ser Tyr He Leu Ser Sex Phe Tyr Tyr Ala Asn Leu Glu 
15 10 15 

■ Glu Asn Ala Leu Gin Phe Leu Arg Lys Glu Ser Met Asp Pro Glu Lys 
20 25 30 

Pro Glu Thr Asn Leu Leu Met Arg Leu Thr 
35 40 



INFORMATION FOR SEQ ID NO:224: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 
(3) .TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 224: 

L^u Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr Leu Glu 
1 "5 10 15 

Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asa Pro Glu Asn 
20 25 30 

Pro Asn' Val Asn Leu Leu Met Arg Leu Thr 
35 40 



INFORMATION FOR SEQ ID MO: 225: 

(l) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .2631 

CD) OTHER INFORMATION: /note^ "Saccharomyces cerevisiae 

EST2 " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: 

ATTTATACTC ATGAAAATCT TATTCGAGTT CATTCAAGAC AAGCTTGACA TTGATCTACA 60 

GACCAACAGT ACTTACAAAG AAAATTTAAA ATGTGGTCAC TTCAATGGCC TCGATGAAAT 12 0 

TCTAACTACG TGTTTCGCAC TACCAAATTC AAG AAAAAT A GCATTACCAT GCCTTCCTGG 13 0 
TGACTTAAGC CACAAAGCAG TCATTGATCA CTGCATCATT TACCTGTTGA CGGGCGAATT . 24 0 

ATACAACAAC GTACTAACAT TTGGCTATAA AATAGCTAGA AATGAAGATG TCAACAATAG 30 0 

TCTTTTTTGC CATTCTGCAA ATGTTAACGT TACGTTACTG ' AAAGGCGCTG CTTGGAAAAT 360 

GTTCCACAGT TTGGTCGGTA CATACGCATT CGTTGATTTA TTGATCAATT ATACAGTAAT 4 20 

TCAATTTAAT GGGCAGTTTT TCACTCAAAT CGTGGGTAAC AGATGTAACG AACCTCATCT 48 0 

GCCGCCCAAA TGGGTCCAAC GATCATCCTC ATCATGCGCA ACTGCTGCGC AAATCAAACA 54 0 

ACTTACAGAA CCA.GTGA.CAA ATAAA.CAA.TT CTTACACAAG CTCAATATAA ATTCCTCTTC 6O0 

TTTTTTTCCT TAT AG C AAG A TCCTTCCTTC ATCATCATCT A T C AAA AAG C TAACTGACTT 660 

GAGAGAAGCT ATTTTTCCCA CAAATTTGGT TAAAA.TTCCT CAGAGACTAA. AGGTACGAAT 7 20 

7AATTTGACG CTGCAAAAGC TATTAAAGAG ACATAAGCGT TTGAATTACG TTTCTATTTT 7 80 

GAA7AGTATT 7G7CCACCAT TGGAAGGGAC CGTATTGGAC TTGTCGCATT TGAGTAGGCA -8 4 0 

A77 ACCAAAG GAACGAG7CT TGAAA7TTAT CATTGTTATT TTACAGAA.GT T ATT AC C CCA 900 

AGAAATGTTT GGCTCAAAGA AAAATAAA.GG AAAAAT TATC AAGAA.TCTAA ATCTTTTATT' 9 60 

AAG TT T AC CC TTAAATGGCT ATTTACCATT TGATAGTTTG T T G AAAAA G T T AAG ATT AAA 10 20 

GGATTTTCGG TGGTTGTTCA TTTCTGATAT TTGGTTCACC AAGCACAATT TTGAAAACTT 1030 

GAATCAATTG GCGATTTGTT TCATTTCCTG GCTATTTAGA CAA.CTAATTC CCAAAATTAT 1140 

ACAGACTTTT TTTTACTGCA CCGAAATATC TTCTACAGTG ACAATTGTTT ACTTTAGACA 12 00 

TGATACTTGG AATAAACTTA TCACCCCTTT TATCG'TAGAA TATTTTAAGA CGTACTTAGT 1260 

C G AAAvAC AAC GTATGTAGAA ACCATAATA.G TTACACGTTG TCCAATTTCA ATCATAGCAA 1320 

AATGAGGATT A T AC C AAAAA AAAGTAATAA TGAGTTCAGG ATTATTGCCA TCCCATGCAG 1380 

AGGGGC AGAC GAAGAAGAAT TCACAATTT A TAA.GGAGAAT CACAAAAATG CTATCCAGCC 14 40 

CACTC AAAAA ATTTTAGAAT AC CT AAG AAA CAAAAGGCCG ACTAGTTTTA CTAAAATATA 1500 

TTCTCCAACG CAAATAGCTG ACCGTATCAA AGAATTTAAG CAGAGACTTT TAAAGAAATT 15 60 
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TAATAATGTC TTACCAGAGC TTTATTTCAT GAAATTTGAT GTCAAATCTT GCTATGATTC 162 0 

CATACCAAGG ATGGAATGTA TGAGGATACT CAAGGATGCG CTAAAAAATG AAAATGGGTT 168 0 

TTTCGTTAGA- TCTCAATATT TCTTCAATAC CAATACAGGT GTATTGAAGT TATTTAATGT 17 4 0 

TGTTAACGCT AGCAGAGTAC CAAAACCTTA TGAGCTATAC ATAGATAATG TGAGGACGGT 18 00 

TCATTTA.TCA AATCAGGATG TTATAAACGT. TGTAGAGATG GAAATATTTA AAACAGCTTT 18 60 

GTGGGTTGAA GATAAGTGCT ACATTAGAGA AGATGGTCTT TTTCAGGGCT CTAGTTTATC 192 0 

TGCTCCGATC GTTGATTTGG TGTATGACGA TCTTCTGGAG TTTTATAGCG AGTTTAAAGC 193 0 

CAGTCCTAGC CAGGACACAT TAATTTTAAA ACTGGCTGAC GATTTCCTTA TAATATCAAC 20 4 0 

AGACCAACAG CAAGTGATCA ATATCAAAAA GCTTGCCATG GGCGGATTTC AAAAATATAA 2100 

TGCGAAAGCC AATAGAGACA AAATTTTAGC CGTAAGCTCC * CAATCAGATG ATGATACGGT 2160 

TATTCAATTT TGTGCAATGC ACATATTTGT TAAAGAATTG GAAGTTTGGA AACATTCAAG 22 20 

CACAATGAAT AATTTCCATA TCCGTTCGAA ATCTAGTAAA GGGATATTTC GAAGTTTAAT 2280 

AGCGCTGTTT AACACTAGAA TCTCTTATAA AACAATTGAC ACAAATTTAA ATTCAACAAA 2340 

CACCGTTCTC A7GCAAATTG A7CATGTTGT AAAGAACATT TCGGAATGTT ATAAATCTGC 2400 

TTTTAAGGAT CTATCAATTA ATGTTACGCA AAATATGCAA TTTCATTCGT TCTTACAACG 2 4 60 

CATCATTGAA A7GACAGTCA GCGGTTGTCC AATTACGAAA TGTGATCCTT TAATCGAGTA 2520 

TGAGGTACGA 77CACCATAT TGAATGGATT TTTGGAAAGC CT ATCTTCAA AC AC AT C AAA. 2 530 

AT77AAAGAT AA7A7CA7TC TT7TGAGA.AA GG AAA TTCAA CACTTGCAAC C 2 631 

iZ; £MrOP.i':A7:OCf TOR 3EQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 amino acids 
:3) TYPE: amino acid 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:226: 

Phe Phe Tyr Cys Thr Glu He Ser Ser Thr Val The lie Val Tyr ?he 

1 - " 5 10 15 

Arg His Asp Thr Trp Asa Lys Leu He Thr Pro Phe lie Val Glu Tyr 

20 25 30 

Phe Lvs Thr Tvr Leu Val Glu Asn Asn Val Cys Arg Asn His Asn Ser 

35 * 40 45 

Tyr Thr Leu Ser Asn Phe Asn His Ser Lys Met Arg He He Pro Lys 
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Lys Ser Asn Asn Glu Phe Arg lie 
65 70 

Asp Glu Glu Glu Phe Thr lie Tyr 
85 

Gin Pro Thr Gin Lys He Leu Glu 
100 

Ser Phe Thr Lys lie Tyr Ser Pro 
115 120 

Glu Phe Lys Gin Arg Leu Leu Lys 
130 135 

Leu Tyr Phe Met Lys Phe Asp Val 
145 150 



He Ala He Pro Cys Arg Gly Ala 
75 30 

Lys Glu Asn His Lys Asn Ala He 
90 " 95 

Tyr Leu Arg Asn Lys Arg Pro. Thr 
105 110 

Thr Gin He Ala Asp Arg He Lys 
125 

Lys Phe Asn Asn Val Leu Pro Glu 

140 . 

Lys Ser Cys Tyr Asp 
155 



INFORMATION FOR SEQ ID NO: 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 amino acids 
(3) TYPE: amino acid 

(C) STRAMDEDNESS: 

(D) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: peptide 



(:<i) SEQUENCE DESCRIPTION : SEQ ID NO: 227; 

Phe Phe Tyr Vai Thr Giu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr 
1 5 10 15 

Tyr Arg Lys Asn lie Trp Asp Vai He Met Lys Met Ser lie Ala Asp 
20 -25 30 

Leu Lys Lys Glu Thr Leu Ala -Glu Val Gin Giu Lys Giu Val Giu Glu 
3 5 4 0 4 5 

Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro 
"50 55 60 

Lys Lys Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys lie Val 
65 - 70 75 '30 

Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr Thr Asn Thr Lys Leu Leu 
35 90 95 

Asa Ser His Leu Met Lea Lys Thr Leu Lys Asn Arg Met Phe Lys Asp 
100 105 110 

Pro Phe Glv Phe Ala Val Phe Asn Tyr Asp Asp Val Met Lys Lys Tyr 
115 120 125 

Giu Glu Phe Val Cys Lys Trp Lys Gin Val Gly Gin Pro Lys Leu Phe 
130 ' 135 140 



Phe Ala Thr Met Asp. lie Giu Lys Cys Tyr Asp 
14 5 150 155 
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(2) IN FORMAT ION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 158 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:228: 

Lys His Lys Glu Gly Ser Gin lie Phe Tyr Tyr Arg Lys Pro lie Trp 
1 5 . 10 15 

• Lys Leu Val Ser Lys Leu Thr- lie Val Lys Val Arg He Gin Phe Ser 
20 25 30 

Glu Lys Asa Lys Gin Met Lys Asa Asn Phe Tyr Gin Lys lie Gin Leu 
35 40 45 

Glu Glu Glu Asn Leu Glu Lys Val Glu Glu Lys Leu lie Pro Glu Asp 
50 55 60 



Ser Phe Gin Lys Tyr Pro Gin Gly Lys Leu Arg lie lie Pro Lys Lys 
65 7 ° 75 

Gly Ser Phe Arg Pro He Met Thr Phe Leu Arg Lys As? Lys Gin Lys 
35 90 ^ 

^sn Ii- Lvs Leu Asn Leu Asn Gin lie Leu Met Asp Ser Gin Leu Val 
' 100 105 110 

□ h „ r^ Q Asn Leu Lvs Aso Met Leu Gly Gin Lys He Gly Tyr Ser Val 
J " 115 120" 

.50 Asn Lvs Gin He Ser Glu Lys Phe Ala Gin Phe He Glu Lys 
^ ^ 130 * - 135 140 



Trp Lys Asn Lys Gly Arg Pro Gin Leu Tyr Tyr Val Thr Leu 
145 



150 155 



(2) INFORMATION FOR SEQ ID MO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) ST RAN D E 0 M E S S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ 10 MO: 229: 

Tyr Tvr Tyr Arg Lys 
1 5 



166 



{2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 230 

Gly Lys Leu Arg He He Pro Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 2 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3). TYPE: amino acid 
(C) STRANDEDNESS : 
■(D) TOPOLOGY: linear 

(ii) 'MOLECULE TYPE': peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:231 

Phe Arc P-ro lie Met Thr ?he 
1 ' 5 



INFORMATION ' FOR SEQ ID MO:232: 

(i) .SEQUENCE CHARACTERISTICS: " 
(A) LENGTH: 2 4 amino acids 
(2) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(;<i) SEQUENCE DESCRIPTION: SEQ ID NO:232: 

Phe Pn* Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe 
1 5 10 15 

Tyr Arg Lys Ser Val Trp Ser Lys 
20 



(2) INFORMATION FOR SEQ ID NO: 23 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 amino acids 
'(B) TYPE: amino acid 

(C) . STRANDEDNESS : ■ 

(D) TOPOLOGY: linear 



167 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser Glu Ala Glu 
1 5 10 15 

Val Arg Gin His Arg Glu Ala 
20 



(2) INFORMATION FOR SEQ ID NO:234: • 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH : 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO:234: 

Ala Arg Thr Phe Arg Arg Glu Lys Asg Ala Giu Arg Leu Thr Ser Arg' 
5 10 15- 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu 
20 25 



INFORMATION FOR SEQ ID NO: 2 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
iO)' TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xx) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
1 ' 5 ' 10 15- 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin 
20 25 



(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly 
1- 5 10 15 

He Arg Gin His Leu Lys Arg Val Gin Leu Arg Asp Val Ser 
20 2 5 30 



(2) INFORMATION FOR SEQ ID NO:237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

( B ) TYPE : amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO:237: 

Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro Lys Pro Asp Gly 
1 5 10 '■ .15 

Leu Ara 'Pro lie Val Asn Met: Asp Tyr Val Val 

20 25 " - 



(2) INFORMATION FOR SEQ ID MO: 233: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 
■:C) STRANDEDNESS: single 
TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 
YARACKAARG GH AT YCCHYA RGG 



(2} INFORMATION EfOR SEQ I D MO ; 2 3 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 amino acids 
(3) TYPE: amino, acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:239: 

Gin Thr Lys Gly lie Pro Gin Gly 
1 5 
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(2) INFORMATION FOR SEQ ID MO: 240: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: 

Asp Asp Tyr Leu Leu lie Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 24 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
( 0} TOPOLOGY: linear 

MOLECULE TYPE: DMA 



(:<l) SEQUENCE DESCRIPTION: SEQ ID NO:241: 

ngtnatdard arrtartcrt c 



(2) INFORMATION FOR SEQ ID NO: 24 2: 

{I; SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 55 amino acids 
■ (E) TYPE; amino acic 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

Lvs Glv lie Pro Gin Gly Leu Cys 
f * 5 

Tyr Tvr Ala Thr Leu Glu Glu Ser 
20 

Ser Met Asn Pro Glu Asn Pro Asn 
35 40 



Val Ser Ser lie Leu Ser Ser Phe 
10 15 

~^r Leu Gly Phe Leu Arg Asp Glu 
25 30 



Val Asn Leu Leu Met Arg Leu Thr 
4 5 



Asp Asp 
50 



Tyr Leu Leu lie Thr 
55 
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(2) I N FORMAT ION TOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

Ser lie Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu Asp Leu lie 
1 5 10 15 

Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys Gly Ser Val Leu Leu Arg 
20 25 30 

Val Val 



(2) INFORMATION FOR SEQ ID NO : 2 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
(3)- TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:244: 

Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro lie Val Asp Leu 
1 * 5 . 10 15 

V-al Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys Ala Ser Pro 
■2 0~ 2 5 30 

Ser Gin Asp Thr Leu lie Leu Lys Leu Ala Asp Asp Phe Leu He He 
35" 40 ' 45 

Ser 



(2) INFORMATION FOR SEQ ID NO : 2 4 5 : 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 8 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 5: 

Gin Lys Val Gly lie Pro Gin Gly 
.1 5 

.(2) INFORMATION FOR SEQ ID NO:246: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base. pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:246: 
CAAAAAGTTG GTATCCCTCA GGG 23 

(2) INFORMATION FOR SEQ ID NO: 24 7; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 247: 

CAAAGG AATTCCATCA GGCTCAATTC TGTCATCTTT TTTGTGTCAT TTCTATATGG ^ 60 

AAGATTTGAT TGATGAATAC CTATCGTTTA CGAAAAAGAA AGGATCAGTG TTGTTACCAG 120 
TAGTCGACGA CTACCTCCTC ATCACC 146 

(2) INFORMATION FOR SEQ' ID^NO:243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 248: 

Lys Gly lie Pro Ser Glv Ser He Leu Ser Ser Phe Leu Cys His Phe 
15 10 15 

Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe Thr Lys Lys Lys 
20 25 30 
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Gly Ser Val Leu Leu Arg Val Val Asp Asp Tyr Leu , Leu He Thr 
35 40 45 



(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: 
GACGATTTCC TCTTTATAAC A 



(2) INFORMATION FOR SEQ ID .NO: 250: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRAMDEDMESS : 

(D) TOPOLOGY: linear 



(ii) ' MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

Aso Aso Phe Leu Phe lie Thr 
1 5 



■2'} INFORMATION FOR SEQ ID NO : 2 5 L : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE-: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. .35 

(D) OTHER INFORMATION: /note- "motif 0 peptide from 

Schizosaccharomyces pombe tezlp" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 5 1 : 

Trp Leu Tyr Asn Ser Phe lie lie Pro lie Leu Gin Ser Phe Phe Tyr 
1 5 '10 15 
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He Thr Glu Ser Ser Aso Leu Arg Asn Arg Thr Val Tyr Phe Arg Lys 
20 " 25 30 

Asp He Trp 
35 



INFORMATION FOR SEQ 10 NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: l/.22 

(0) OTHER INFORMATION: /note- "motif 1_ and 2 peptide from 

Schizosaccharomyces pombe tezlp" 



[:<i) SEQUENCE DESCRIPTION: SEQ ID MO:252: 

Ala Val He Arg Leu Leu Pro Lys Lys Asn Thr Phe Arg Leu He Thr 
L 5 10 15 

Asn Leu Arg Lys Arg Phe 
20 



INFORMATION FOR SEQ 10 NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

iA) LENGTH: 26 amino acids 
{3) TYPE: amino acid" 
(C) STRANDEDNESS : 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(!:<;} FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION : 1 . . 26 

(D) OTHER INFORMATION: /note- "motif 3(A) peptide from 

Schizosaccharomyces pombe tezlp' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:253: 

Lys Lvs Tyr Phe Val Arg He Asp lie Lys Ser Cys Tyr Asp Arg He 
1 5 10 15 

Lys Gin Asp Leu Met Phe Arg He Val Lys 
20 25 
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(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(i:<) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1 . . 32 

(D) OTHER INFORMATION: /note-' "motif 4(B'} peptide from 

Schizosaccharomyces pombe tezlp" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

Tyr Leu Gin Lys Val Giy He Pro Gin Gly Ser He Leu Ser Ser Phe 
1 5 10 15 

Leu Cy's His Phe Tyr Met Glu Asp Leu He Asp Glu Tyr Leu Ser Phe 

20 25 - - 30 



(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 4 9 amino acids 
(3) TYPE: amino acid 
(C) ST RAND EDM ESS : 
{0} TOPOLOGY : linear 

{ i i ) MOLECULE- TYPE : peptide 



(ix)- FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION: 1 . .49 

{ o ) OTHER INFORMATION: /no:^ "motif 5(C) and 6(D) peptide from 

Schizosaccharomyces pombe tezlp" 



[xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: 

Val L^u Leu Arg Val Val Aso Asp Phe Leu Phe He Thr Val Asn Lys 

15 10 15 

Lys Aso Aia Lys Lys Phe Leu Asn Leu Ser Leu Arg Gly Phe Glu Lys 

20 25 30 

His Asn Phe Ser Thr Ser Leu Giu Lys Thr Val He Asn Phe Glu Asn 

35 40 ^ 45 



Ser 
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INFORMATION FOR SEQ ID MO : 2 5 6 ; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: amino acid 
"(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..34 

(D) OTHER INFORMATION: /note- "motif 0. peptide from ^ 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: 

Trp L-u Phe Arg Gin Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr 
1 5 -10 15 

Cys Tb- Giu lie Ser Ser Thr Val Thr lie Val Tyr Phe Arg His Asp 
20 25 30 

Thr Tro 



) INFORMATION FOR SEQ ID MO: 257: 

(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 

(3) TYPE: amino acid 

(C) STRANDEDNESS : 

{ 0 ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:-:) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION : 1 . . 25 

(D) OTHER INFORMATION: /note- "motif 1 and 2 peptide rrom 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION.: SEQ ID NO: 257: 

Ser Lvs Met Arg lie lie Pro Lys Lys Ser Asn Asn Giu Phe Arg lie 

10 15 



1 5 



lie Ala lie Pro Cys Arg Giy Ala Asp 
20 25 



176 



INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1..26 

(0) OTHER INFORMATION: /note- "motif 3(A) peptide trom 

Saccharomyces cerevisiae EST2p" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: 

Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser lie 
1 5 10 13 

Pr 0 -r Q M-r Glu Cys Met Arg He Leu Lys 
20 25 



INFORMATION FOR SEQ 10 NO: 25 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 32- amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<:) FFATURc: 

(A) NAME/ KEY: Peptide 

(3) "LOCATION ; ' 1 . . 32 . . 

(D) OTHER INFORMATION: /note= "motif 4(B') peptide rrcrr. 

Saccharomyces cerevisiae EST2p" 



SEQUENCE DESCRIPTION: SEQ ID NO: 259: 
Tvr lie Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro 

r s 10 

lie Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe 
20 



25 30 



) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: 1. . 4 9 

(D) OTHER INFORMATION: /note= "motif 5(C) peptide from 

Saccharomyces cerevisiae EST2p" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 60 : 

Leu lie Lea Lys Leu Ala Asp Asp Phe Leu lie lie Ser Thr Asp Gin 
15 10 is" 

Gin Gin Val He Asn-Ile Lys Lys Leu Ala Met Gly Gly.Phe Gin Lys 
20 25 ' 30 

Tyr Asn Ala Lys Ala Asn Arg Asp Lys He Leu Ala Val Ser Ser Gin 
35 40 45 

Ser 



2} INFORMATION FOR SEQ 10 NO : 2 6 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(3) TYPE: amino acid 

JO STRANDEOMESS : 

{ D) TOPOLOGY: linear 

:li) XHECULE TYPE: peptide 



( A ) M AME/KEY : Peptide 
; 5) LOCATION ; 1 . . 35 

;0) OTHER INFORMATION: /no-ts = "motif 0 peptide from 

Euploces aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 6 1 : 

Trp lie Phe Glu Asp Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr 
1 5 10 15 

Val Thr Glu Gin Gin Lys Ser Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys 
20 25 .30 

Asn He Trp 
35 



2) INFORMATION FOR SEQ ID MO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 
(3) TYPE: amino acid 

(C) ST RAND EDM ESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 . .23 

(D) OTHER INFORMATION: /note* "motif 1 and 2 peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:262: 

Giy Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met 
1 5 10 15 

Thr Phe Asn Lys Lys lie Val 

20 ' 



(2) INFORMATION FOR SEQ I D MO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 "amino acids 

(B) TYPE : amino acid 

(C) ST RAND EDM ESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1.".26 

(0) OTHER INFORMATION: /note= "motif 3(A) peptide fro:?. 

Etiolates aediculatus pi 23" 



(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 263: ' 

Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser Val 
1* 5 10 15 

Asn Arg Glu Lys Leu Ser Thr Phe Lea Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 264: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: .1. .32 

(D) OTH.ER IN FORMAT ION : /note= "motif 4(B f ) peptide from. 

Euplotes aediculatus p!23 n 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 

Tyr Lys Gin Thr Lys Gly lie Pro Gin Gly Leu Cys Val Ser Ser He 
1 5 . 10 15 

Leu Ser Ser Phe Tyr Tyr Ala'Thr Leu Glu Glu Ser Ser Leu Gly Phe 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 265: 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 9 amino aoids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) KOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION : 1 . . 49 

(D) OTHER INFORMATION: '/note- "motif 5(C) and 6(0) peptide from 

Euolo ces aediculatus pi23" 



(:-;L) SEQUENCE DESCRIPTION: SEQ ID NO: 265 : 

Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu He Thr Thr Gin Glu 

1 5 ' 10 15 

Asn Asn Ala Val Leu Phe He Glu Lys Leu He Asn val Ser Arc Glu 
20 25 30 

Asn Gly Phe Lys Phe Asn Met Lys Lys Leu Gin Thr Ser Phe Pro Leu 
3 5 4 0" 45 

Ser 



(2) INFORMATION FOR SEQ ID NO : 2 6 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2171 base pairs- 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 22 . . 1716 

(D) OTHER INFORMATION: /note= "EcoRI-NotI insert of 

clone 712562 encoding 63 kDa 
telomerase protein" 

<xi)' SEQUENCE DESCRIPTION: SEQ ID NO:266: 

GCCAAGTTCC TGCACTGGCT GATGAGTGTG TACGTCGTCG AGCTGCTCAG GTCTTTCTTT . 60 

TATGTCACGG AGACCACGTT TCAAAAGAAC AGGCTCTTTT TCTACCGGAA GAGTGTCTGG 120 

AGCAAGTTGC AAA.GCATTGG AATCAGACAG CACTTGAAGA GGGTGCAGCT GCGGGAGCTG 180 

TCGGAAGCAG AGGTCAGGCA GCATCGGGAA GCCAGGCCCG CCCTGCTGAC GTCCAGACTC 240 

CGCTTCATCC CCAAGCCTGA CGGGCTGCGG CCGATTGTGA' ACATGGACTA CGTCG'TGGGA 300 

GCCAGAACGT TCCGCAGAGA AAAGARGGCC GAGCGTCTCA CCTCGAGGGT GAAGGCACTG 3 60 

TTCAGCGTGC TCAACTACGA GCGGGCGCGG CGCCCCGGCC TCCTGGGCGC CTCTGTGCTG 420 

GGCCTGGACG ATATCCACAG GGCCTGGCGC ACCTTCGTGC TGCGTGTGCG -GGCCCAGGAC 4 80 

CCGCCGCCTG AGCTGT ACTT TGTCAAGGTG GATGTGACGG GCGCGTACGA CACCATCCCC 540 

CAGGACAGGC TCACGGAGGT CATCGCCAGC ATCATCAAAC CCCAGAACAC GTACTGCGTG 600 

CGTCGGTATG CCGTGGTCCA GAAGGCCGCC ATGGGCACGT CCGCAAGGCC TTCAA.GAGCC 6 60 

ACGTCCTACG TCCAGTGCCA GGGGATCCCG CAGGGCTCCA TCCTCTCCAC GCTGCTCTGC 7 20 

A "TCTGTGCT ACGGCGACAT GGAGAACAAC CTGTTTGCGG GGATTCGGCG GGACGGGCTG 7 30 

T:::TGCGTT TGGTGGATGA TTTCTTGTTG GTGACACCTC ACCTCACCCA CGCGAAAACC 8 40 

TTCCTCAGGA CCCTGGTCCG AGGTGTCCCT GAGTATGGCT GCGTGGTGAA CTTGCGGAAG 900 

ACAGTGGTGA ACTTCCCTGT AGAAGACGAG GCCCTGGGTG GCACGGCTTT TGTTCAGATG 9 60 

CCGGCCCACG GCCTATTCCC CTGGTGCGGC CTGCTGCTGG ATACCCGGAC CCTGGAGGTG 10 20 

CAGAGCGACT ACTCCAGCTA TGCCCGGACC TCCATCAGAG CCAGTCTCAC CTTCAACCGC 10 80 

GGCTTCAAGG CTGGGAGGAA CATGCGTCGC AAACTCTTTG GGGTCTTGCG GCTGAAGTGT 1140 

CACAGCCTGT TTCTGGATTT GCAGGTGAAC AGCCTCCAGA CGGTGTGCAC CAACATCTAC 12 00 

AAGATCCTCC TGCTGCAGGC GTACAGGTTT CACGCATGTG TGCTGCAGCT CCCATTTCAT 12 60 

CAGCAAGTTT GGAAGAACCC ACATTTTTCC TGCGCGTCAT CTCTGACACG GCTCCCTCTG 132 0 

CTACTCCATC CTGAAAGCCA AGAACGCAGG GATGTCGCTG GGGGCCAAGG GCGCCGCCGG 133 0 

CCCTCTGCCC TCCGAGGCCG TGCAGTGGCT GTGCCACCAA GCATTCCTGC TCAAGCTGAC 14 4 0 

TCGACACCGT GTCACCTACG TGCCACTCCT GGGGTCACTC AGGACAGCCC AGACGCAGCT 1500 

GAGTCGGAAG CTCCCGGGGA CGACGCTGAC TGCCCTGGAG GCCGCAGCCA ACCCGGCACT - 15 60 
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GCCCTCAGAC TTCAAGACCA TCCTGGACTG ATGGCCACCC GCCCACAGCC AGGCCGAGAG 1620 

CAGACACCAG CAGCCCTGTC ACGCCGGGCT TATACGTCCC AGGGAGGGAG GGGCGGCCCA 168 0 

CACCCAGGCC TGCACCGCTG GGAGTCTGAG GCCTGAGTGA GTGTTTGGCC GAGGCCTGCA 174 0 

TGTCCGGCTG AAGGCTGAGT GTCCGGCTGA GGCCTGAGCG AGTGTCCAGC CAAGGGCTGA 18 00 

GTGTCCAGCA CACCTGCGTT TTCACTTCCC CACAGGCTGG CGTTCGGTGC ACCCCAGGGC 18 60 
CAGGTTTTCC TCACCAGGAG CCCGGCTTCC ACTCCCCACA TAGGAATAGT CCATCCCCAG 1920 

ATTCGCCATT GTTCACCCTT CGCCCTGCCT "TCCTTTGCCT TCCACCCCCA CCATTCAGGT 198 0 

GGAGACCCTG AGAAGGACCC TGGGAGCTTT GGGAATTTGG AGTGACCAAA . GGTGTGCCCT 20 4 0 

GTAGACAGGC GAGGACCCTG CACCTGGATG GGGGTCCCTG TGGGTCAAAT TGGGGGGAGG 2100 

TGCTGTGGGA GTAAAATACT GAATATATGA GTTTTTCAGT TTTGGAAAAA AAAAAAAAAA 2160 

AAAAAAAAAA A 2171 

(2) IMFORMATIOt^f FOR S£Q 10 NO:267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 amino acids 
(3) TYPE: amino acid 

(C) STRAMDEDNESS : 

(D) TOPOLOGY; linear 

(11) MOLECULE TYPE: protein 



(L:-:) FEATURE : 

( A) NAME /KEY : Protein 
(3) LOCATION: 1 . . 564 

(0) OTHER " INFORMATION: /note- "63 kOa t e lone ra 39 protein 

encoded "by ORE of EcoRI-Notl' insert of 
clone 712562" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 267: 

Met Ser Val Tyr Val Val GIu Leu Leu Arg Ser Phe Phe Tyr Val Thr 
1 '5 10 15 

Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val 
20 25 30 

Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His Leu Lys Arg Val 
35 40 45 

Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala 
50 55 60 

Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro Asp 
65 70 75 80 

Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr 
85 90 95 
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Phe Arg Arg Glu 
100 

Leu Phe Ser Val 
115 

Gly Ala Ser Val 
130 

Phe Val Leu Arg 
145 

Val Lys Val Asp 



Leu Thr Glu Val 
180 

Val Arg. Arg Tyr 
195 

Arg Pro Ser Arg 
210 

Gly Ser lie Leu 
225 

Glu Asn Lys Leu 



Leu Vai Asp Asp 
260 

Thr Phe Leu Arg 
275 

Val Asn. Leu Arg 
2 90 

Leu Gly Giy Thr 
30 5 

Trp Cys Gly Leu 



Tyr Ser Ser Tyr 
340 

Arg Gly Phe Lys 
355 

Leu Arg ■ Leu Lys 
370 

Leu Gin Thr Val 
335 

Tyr Arg Phe His 



Lys Xaa Ala Glu 



Leu Asn Tyr Glu 
120 

Leu Gly Leu Asp 
135 

Val Arg Ala Gin 
150 

Val Thr Gly Ala 
165 

He Ala Ser He 



Ala Val Val Gin 
200 

Ala Thr Ser Tyr 
215 

Ser Thr Leu Leu 
"230 

Phe Ala Gly He 
245 

Phe Leu Leu Vai 



Thr Leu Val Arg 

. 230 

Lys . Thr Va 1 Va 1 
295 

Ala Phe Vai Gin 
310 

Leu Leu Asp Thr 
325 

Ala Arg Thr Ser 



Ala Gly Arg Asn 
360 

Cys His Ser Leu 
375 

Cys Thr Asn He 
390 

Ala Cys Val Leu 
405 



Arg Leu Thr Ser 
105 

Arg Ala Arg Arg 



Asp He His Arg 
140 

Asp Pro Pro Pro 
155 

Tyr Asp Thr He 
170 

He Lys Pro Gin 
185 

Lys Ala Ala Met 



Val Gin Cys Gin 
220 

Cvs Ser Leu _Cys 
235 

Arg Arg Asp Gly 
250 

Thr Pro His Leu 
265 

Gly Val Pro Glu 



Asn Phe Pro Vai 

300 

Met: Pro Ala His 
315 

Arg Thr Leu Giu 
330 

lie Arg Ala Ser 
345 . 

Met Arg Arg Lys 



Phe Leu Asp Leu 
330 

Tyr Lys He Leu 
395 

Gin Leu Pro Phe 
4 10- 



Arg Val Lys Ala 
110 

Pro Gly Leu Leu 
125 

Ala Trp Arg Thr 



Glu Leu Tyr Phe 
160 

Pro Gin Asp Arg 
175 

Asn Thr Tyr Cys 
190 

Giy Thr Ser Ala 
205 

Gly He Pro Gin 



Tyr Gly Asp Met 
240 

Leu Leu Leu Arg 
255 

Thr His Aia Lys 
270 

Tyr Gly Cys Val 
285 

Giu Asp Glu Ala 



Glv Leu Phe Pro 
320 

Val Gin Ser Asp 
335 

Leu Thr Phe Asn 
350 

Leu Phe Gly Val 
365 

Gin Val Asn Ser 



Leu Leu Gin Ala 
400 

His Gin Gin Val 
415 
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Trp Lys Asn Pro His Phe 
420 

Leu Leu Leu His Pro Glu 
435 

Gin Giy Arg Arg Arg Pro 
450 

Pro Pro Ser lie Pro Ala 
465 470 

Ala Thr Pro Gly Val Thr 
485 

Ala Pro Gly Asp Asp Ala 
500 

Thr Ala Leu Arg Leu Gin 
515 

Gin Pro Gly Arg Glu Gin 
530 

Thr Ser Gin Gly Gly Arg 
545 550 

Glu Ser Glu Ala 



IN FORMAT ION TOR SEQ ID MO: 2 68: 

(D sequence: characteristics.* 

(A) LENGTH: 22 amino acids 

{ 3 ) T Y ? E : amino a c i c 

[2) STRANDEDNE5S: 

(0) TOPOLOGY : linear 

(ii) MOLECULE: TYPE: peptide 



( ix) FEATURE : 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note s "motif 1 peptide from 

Eupioces aediculatus pI23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 68 : 

Leu Val Val Ser Leu He Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin 
I 5 10 ' 15 

Lys Ser Tyr Ser Lys Thr 
20 



Ser Cys Ala Ser Ser 
425 

Ser Gin Glu Arg Arg 
4 4J0 

Ser Ala Leu Arg Gly 
455 

Gin Ala Asp Ser Thr 
475 

Gin Asp Ser Pro Asp 
490 

Asp Cys Pro Gly Gly 
505 

Asp His Pro Gly Leu 
520 

Thr Pro Ala Ala Leu 
535 

Giy Gly Pro His Pro 
555 



Leu Thr Arg Leu Pro 
430 

Asp Val Ala Gly Gly 
445 

Arg Ala Val Ala Val 
460 

Pro Cys His Leu Arg 
480 

Ala Ala Glu Ser Glu 
4 95 

Arg Ser Gin Pro Gly 
510 

Met Ala Thr Arg Pro 
525 

Ser Arg Arg Ala Tyr 
540 

Gly Leu His Arg Trp 
560 
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(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION : 1 . . 30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:269: 

Lys Ser Leu Gly Phe Ala Pro Gly Lys Leu Arg Leu lie Pro Lys Lys 
I 5 . 10 15 

Thr Thr Phe Arg Pro lie Met Thr Phe Asn Lys Lys lie Val 
20 25 * 30 



(2) INFORMATION FOR SEQ ID MO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino -acids 
(3) TYPE: amino acid 
■;C) STRANDEDNESS : 
(3) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE': 

(A) NAME/KEY: Peptide 
(3) LOCATION : 1 . . 27 

; o ) OTHER INFORMATION: /note- "motif A peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ 10 MO: 270: 

Pro Lys Lea Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr Asp Ser 
1 5 10- 15 

Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys 
20 25 



(2) INFORMATION FOR SEQ ID NO: 27 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(.5) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. . 27 

(D) OTHER INFORMATION: /note* "motif B peptide from 

Euplotes aediculatus p!23 ,f 



(;<i) SEQUENCE DESCRIPTION: SEQ ID NO:271: 

Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly lie Pro Gin Giy Leu Cys 
1 5 10 15 

Val Ser Ser lie. Lea Ser Ser Phe Tyr Tyr Ala 
20 25 



INFORMATION FOR SEQ ID NO: 27 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(BJ- TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



[ix) FEATURE: 

(A) NAME /KEY: Peptide 
(3) LOCATION: 2 2 

(D) OTHER INFORMATION: /note= "motif C peptide from 

Euolotes aediculatus p!23" 



;>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 27 2: 

Pro Asn Vai Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu Leu lie 
i 5 10 15 

Thr Thr Gin Glu Asn Asn 
20 



INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:- 15 amino- acids 

(B) TYPE: amino acid 
(O STRANDEDNESS: 

(DJ TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION : 1. . 15 

(D) OTHER INFORMATION: /note*: "motif D peptide from 

Euplotes aediculatus p!23" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:273: 

Asa Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 27 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino acid 
CC} ST RAND £0 NESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE : _ - 

(A) NAME /KEY : Peptide 
(3) LOCATION: i . .22 

(D) OTHER INFORMATION: /note* "motif i peptide from 

Schizosaccharomyces" pombe tezl' 



(xi) SEQUENCE DESCRIPTION: SZQ ID MO:274: 

Phe He He Pro He Leu Gin Ser ?he Phe Tyr He Thr Glu Ser Ser 
1 5 10 15 

Asp Leu Arg Asn Arg Thr 
20 



(2) INFORMATION FOR SEQ'lD NQ:275: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

( B ) TYPE: amino acid 

(C) STRAMDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION : 1 . . 30 

(D) OTHER INFORMATION: /note= "motif 0 peptide from 

Schizosaccharomyces pombe tezl" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

Gin Lys Thr Thr Leu Pro Pro Ala Val lie Arg Leu Leu Pro Lys Lys 
1 5 10 15 

Asn Thr Phe Arg Leu lie Thr Asn Leu Arg Lys Arg Phe Leu 
20 . 25 30 



(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:, peptide 



<ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1. .27 

(D) OTHER INFORMATION: /note= "motif A peptide from 
; Schizosaccharomyces pombe tezl" 



(:<ij SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

Arq Lvs Lys Tvr Phe Val Arg lie Asp lie Lys Ser Cys Tyr Asp Arg 
1 5 * 10 15' 

lie Lys Gin Asp Leu Met Phe Arg He Val Lys 
20* 25 



[2] - INFORMATION FOR SEQ ID MO:277: 

(i) SEQUENCE CHARACTERISTICS: 

( A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Cix) FEATURE: 

(A) NAME /KEY : Peptide 
(3) LOCATION:' 1 . .27 

(D) OTHER INFORMATION: /note= "motif B peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

Gly Asn Ser Gin Tvr Leu Gin Lys Val Gly He Pro Gin Gly Ser He 
1.5' 10 15 



Leu Ser Ser Phe Leu Cys His Phe Tyr Met Glu 
■20 25 



188 



2) INFORMATION FOR SEQ ID NO: 278: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22. amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 
(3) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note- "motif C peptide from 

Schizosaccharomyces pombe tezl M 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:278: 

Lys Lys Gly Ser Val Leu Leu Arg Val Val Asp Asp Phe Leu Phe lie 
15 10 15 

Thr Val Asn Lys Lys Asp 
20 



2) INFORMATION EOR SEQ ID MO: 27 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(Q) TOPOLOGY: linear 

(Li) MOLECULE TYPE : peptide. 



(i:*:) FEATURE : 

(A) NAME /KEY: Peptide 
(5}- LOCATION: 1 . T15 ■ 

(D) OTHER INFORMATION: /note=» "motif D peptide from 

Schizosaccharomyces pombe tezl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: 

Leu Asn Lea Ser Leu Arg Gly Phe GIu Lys His Asn Phe Ser Thr 
15 10 15 



2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note- "motif 1 peptide from 

SaccharQmyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:280: 

Leu lie Pro Lys lie lie Gin Thr Phe Phe Tyr Cys Thr Glu lie Ser 
1 5 10 15 

Ser Thr Val Thr lie Val 
20 



(2) INFORMATION FOR SEQ ID NO: 281: ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
CD) TOPOLOGY: linear 

(ii) tXOLECULE TYPE: peptide 



(ix) -FEATURE: 

{A) NAME/KEY: Peptide 
(3) LOCATION: 1. .32 

(D) OTHER INFORMATION: /note^ "motif 0 peptide from 

Saccha romyces cerevisiae EST 2 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

.Thr Leu Ser Asn ..Phe Asn His Ser Lys Met Ar.g He He Pro Lys Lys 
I 5 10 15 

Ser Asn Asn Glu Phe Arg He lie Ala He Pro Cys Arg Gly Ala Asp 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 23 2: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1. . 27 

(D) OTHER INFORMATION: /note= "motif A peptide from 

Saccharomyces cerevisiae EST2" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: 

Pro Glu Leu Tyr Phe Met Lys Phe Asp Val Lys Ser Cys Tyr Asp Ser 
1 5-10 15 

He Pro Arg Met Glu Cys Met Arg He Leu Lys 
20 2 5 



(2) INFORMATION FOR SEQ ID NO:283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY: Peptide 

(B) LOCATION: 1. .27 

(D) OTHER INFORMATION: /note- "motif B peptide from 

Saccharotnyces cerevisiae EST2' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: . 

Glu Aso Lys Cys Tyr Tie Arc Glu Aso Giy Leu Phe Gin Gly Ser Ser 
1 5 ' '-10 15 

Leu Ser Ala Pro lie Val Asp Leu Val Tyr Asp 
20 25 



; 2 ) I M FORMAT i ON FOR SEQ ID NO : 2 3 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 
(3) T'fPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix} FEATURE : 

(A) - NAME /KEY : Peptide 

( B) LOCATION: 1 . . 2 2 

(D) OTHER INFORMATION: /note* "motif C peptide from 

Saccharomyces cerevisiae EST2 ' 



[kL] SEQUENCE DESCRIPTION: SEQ ID NO: 234: 

Ser Gin Asp Thr Leu lie Leu Lys Leu Ala Asp Asp Phe Leu lie lie 
1 - " 5 10 15 



Ser Thr Asp Gin Gin Gin 
20 
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(2) INFORMATION FOR SEQ ID MO: 235: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY :. Peptide 
(3) LOCATION: 1. . 15 

(D) OTHER INFORMATION: /note= "motif D peptide from. 

Saccharomyces cerevisiae EST2" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:285: 

Lys Lys Leu Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala 
1 5 10 "15 



.(2) INFORMATION FOR SEQ ID MO: 23 6: 

(i) SEQUENCE- CHARACTERISTICS : 

(A) LENGTH: 22 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide. 
(2) LOCATION: 1..2 2 

■(D) OTHER INFORMATION: /note- "motif I peptide from human 

telor.erase core protein L-(TCPI)" 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 236: 

Tyr Val Va 1 Glu Leu Leu Arg Ser Phe Phe Tyr Va 1 Thr Glu Thr Thr 
1 5 10 15 

Phe Gin Lys Asn Arg Leu 

. 20 



(2} INFORMATION FOR SEQ ID MO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/ KEY : Peptide 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note» "motif 0 peptide from human 

telomerase core protein 1 (TCP1)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 287: 

Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie Pro Lys Pro 
15 10 15 

Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly 
20 25 30 



(2) INFORMATION FOR SEQ ID NO: 28 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

CD) TOPOLOGY: linear 

( ii). MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) NAME/ KEY : Peptide 
(5) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note- 1 "motif A peptide from human 

ceiome rase core protein I (TCP!)" 



Cx.i) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 

Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr- 
1 5 ■ 10 15 

lie Pro Gin Asp Arg Leu- Thr Glu Val lie Ala 
20* " ' 25 



(2) INFORMATION FOR SEQ ID NO: 289: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 
(3) TYPE: amino acid 
CO STRA.NDEDNESS : 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Protein 
(3) LOCATION: 1 . . 27 

"(D) OTHER INFORMATION: /note* "motif B peptide from human 

telomerase core protein 1 (TCP1)" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 9: 

Arg Ala Thr Ser Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser lie 
1 5 10 15 

Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly 
20 25 



INFORMATION FOR SEQ ID NO: 2 90 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 amino acids 

(B) TYPE: amino, acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY : Peptide 

(B) LOCATION: .22 

(0) OTHER INFORMATION: /note- "motif C peptide from hunan 

telomerase core protein 1 (TCPI) " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

Arg Arg Asp Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val 
L * 5 10 15 

Thr Pro His Leu Thr His 
20 



I N FORMATION FOR SEQ ID NO : 2 9 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
■ (C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Peptide 
(3) LOCATION: 1 . . 15 

(D) OTHER INFORMATION: /note^ "motif D peptide from hurr.an 

telomerase core protein 1 (TCP!)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:291: 



Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO: 2 92 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..4029 

{ D) OTHER INFORMATION: /note- "preliminary sequence for 

human TRT cDNA insert of 
plasmid pGRN121" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCGATGCC 60 

GCGCGGTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC AGCCACTACC GCGAGGTGCT 120 

GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG GCCCCA^GGGC TGGCGGCTGG ' TGCAGCGCGG 18 0 

GGACCCGGCG GCTTTCCGCG CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN 24 0 

ANGGCNGCCC CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAAMG ANCTGGTGGC 30 0 

CCGAGTGCTG CAMAMGCTGT GCGAMCGCGG CGCGAAMAAC GTGCTGGCCT TCGGCTTCGC 360 

GCTGCTGGAC GGQGCCCGCG GGGGCCCCCC CGAGGCCTTC ACCACCAGCG TGCGCAGCTA 420 

CC7GCCCAAC ACGGTGACCG ACGC ACT GCG GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG ~480 

CCGCGTGGGC GACGACGTGC TGGTTCACCT GCTGGCACGC TGCGCGMTMT TTGTGCTGGT 54 0 

GGilTCCCAGC TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCMAC 600 

TCAGGCCCGG CCCCCGCCAC ACGCTAMTGG ACCCGAANGC GTCTGGGATC CAACGGGCCT 660 

GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG CCAGCCCCGG GTGCGAGGAG 72 0 

GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC 78 0 

CCCTGAGCCG GAGCGGACGC CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC 84 0 

TGGACCGAGT GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC "900 

CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG GCCGCCAGCA 960 

CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT GGGACACGCC TTGTCCCCCG 1020 

GTGTACGCCG AGACCAAGCA CTTCCTCTAC TCCTCAGGCG ACAAGNACAC TGCGMCCGTC 108 0 

CTTCCTACTC AATAT ATCTG AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA 114 0 

NTCTTTCTGG TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 1200 
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GCGMTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG CGCAGTGCCC 12 60 

CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT GCGGTCACCC CAGCAGCCGG 1320 

TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC TGTGGCGGGG GGCGAGGAGG AGGAACACAG 138 0 

ACCCCCGTCG CCTGGTGCAG CTGCTCCGCC AG C AC AG C AG CCCCTGGCAG GTGTACGGCT 14 4 0 

TCGTGCGGGC CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 1500 

AACGCCGCTT CCTCAGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT GCCAAGCTCT 15 60 

CGCTGCAGGA GCTGACGTGG AAGATGAGCG . TGCGGGACTG CGCTTGGCTG CGCAGGAGCC 1620 

CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC ACCGTCTGCG TGAGGAGATC CTGGCCAAGT .1680 

TCCTGCACTG GCTGATGAGT GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA " 174 0 

CGGAGACCAC GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 18 00 

TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG CTGTCGGAAG I8 60 

CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT GACGTCCAGA CTCCGCTTCA 1920 

TCCCCAAGCC TGACGGGCTG CGGCCGATTG TGAACATGGA CTACGTCGTG GGAGCCAGAA 198 0 

CGTTCCGCAG AGAAAAGAGG GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG 20 4 0 

TGCTCAACTA CGAGGGGGGG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 2 LOO 

ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG GACCCGCCGC 2160 

CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA CGACACCATC CCCCAGGAGA 22 2 0 

GGCTCACGGA GGTCATCGCC AGCATCATCA AACCCCAGAA CACGTACTGG G7GCGTCGG? 2230 

ATGCCGTGGT CCAGAAGGCC GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT 2340 

CTACCTTGAC- AGAGCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 24 00 

GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG GCCAGCAGTG 24 60 

GCCTCTTCGA GGTCTTCCTA CGCTTCATGT GCCAGCACGC CGTGCGCATG AGGGGCAAGT 25 20 

CCTACGTCCA GTGCCAGGGG ATCCCGCAGG GCTCCATCCT CTCCACGCTG CTCTGCAGCG 25 8 0 

TGTGCTACGG CGACATGGAG AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC 2 64 0 

TGCGTTTGGT GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 27 00 

TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAA.GTTG CGGAAGACAG 27 60 

TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC GGCTTTTGTT CAGATGCCGG 28 20 

CCCACGGCCT ATTCCCCTGG TGCGGCCTGC TGCTGGATAC CCGGACCCTG GAGGTGCAGA 28 8 0 

GCGACTACTC CAGCTATGCC CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT 29 4 0 

TCAAGGCTGG GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 30 0 0 

GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC ATCTACAAGA 30 60 
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TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT GCAGCTCCCA TTTCATCAGC 3120 

AAGTTTGGAA GAACCCCACA TTTTTCCTGC GCGTCATCTC TGACACGGCC TCCCTCTGCT 318 0 

ACTCCATCCT GAAAGCCAAG AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC 3 240. 

CTCTGCCCTC CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 3 300 

GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG ACGCAGCTGA 3 3 60 

GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC CGCAGCCAAC CCGGCACTGC 3 420 

CCTCAGACTT CAAGACCATC CTGGACTGAT GGCCACCCGC CCACAGCCAG GCCGAGAGCA 34 8 0 

GACACCAGCA GCCCTGTCAC GCCGGGCTCT ACGTCCCAGG GAGGGhGGGG CGGCCCACAC 3 54 0 

CCAGGCCCGC ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 3 600 

CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA GGGCTGAGTG 3 660 

TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG CTCGGCTCCA CCCCAGGGCC 3720 

AGCTTTTCCT CACCAGGAGC CCGGCTTCCA CTCCCCACAT AGGAATAGTG CATCCCCAGA 3730 

■TTCGCCATTG TTCAGGCCTC GCCCTGCCGT CCTTTGCCTT CCACCCCCAC CATCCAGGTG -3840 

GAGACCCTGA GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 3 900 

TACACAGGCG AGG AC CCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT GGGGGGAGGT ~ 3 960 

GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT TTGAAAAAAA AAAAAAAAAA 4 020 

AAAAAAAAA. 4 02 9 



I i-i FORMAT ro:: TOR SZQ ID NO : 2 93: 

(i) SEQUENCE CHA?.ACTE£IS?ICS : 

(A) LENGTH : 261 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

Aia Ala Leu Arg Pro Ala Ala His Val Gly Ser Pro Giy Pro Gly His 

15 10 15 

Pro Arq Asp Ala Ala Arg Ser Pro Leu Pro Ser Arg Ala Leu Pro Ala 
20 25 30 

Ala Gin Pro Leu Pro Arg Giy Ala Ala Ala Gly His Val Arg Aia Ala 
35 40 45 

Pro Gly Ala "Pro Gly Leu Ala Ala Giy Ala Ala Arg Gly Pro Gly Gly 
50 55 60 
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Phe Pro Arg Xaa Gly Gly Pro Xaa 

65 70 

Xaa Ala Ala Pro Arg Arg Pro Leu 
85 

Xaa Xaa Gly Gly Pro Ser Ala Ala 
100 



Xaa Gly Val Arg Ala Leu Gly Xaa 
75 80 

Leu Pro Pro Gly Val Leu Pro Glu 
90 " 95 

Xaa Ala Val Arg Xaa Arg Arg Glu 
105 110 



Xaa Arg Ala Gly Leu Arg Leu Arg Ala Ala Gly Arg Gly Pro Arg Gly 
115 120 125 

Pro Pro Arg Gly Leu His His Gin Arg Ala Gin Leu Pro Ala Gin His 
130 135 140 

Gly Asp Arg Arg Thr Ala Gly Glu Arg Gly Val Gly Ala Ala Ala Ala 
145 150 155 160 

Pro Arg Gly Arg Arg Arg Ala Gly Ser Pro Ala- Gly Thr Leu Arg Xaa 
165 170 175 

Xaa Cys Ala Gly Gly Ser Gin Leu Arg Leu Pro Xaa Val Arg Ala Ala 
130 135 . 190 

Ala Val Pro Aia Arg Arg Cys Xaa Ser Gly Pro Ala Pro Ala Thr Arg 
195 200 . ' 205 

Xaa Trp Thr Arg Xaa Arg Leu Gly Ser Asn Gly Pro Gly Thr lie Ala 
" 210 215 220 

Ser Gly Arg Pro Gly Ser Pro Trp Ala Ala Ser Pro Giy Cys. Glu Glu 
225 230 235 240 

Aia Arg Giy Gin Cys Gin Pro Lys Ser Ala Val Ala Gin- Glu Ala Gin 
245 250^ 255 

Ala Trp Arg "Cys Pro 
- 260 



INFORMATION FOP SEQ ID NO:294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMSSS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 294: 

Ala Giy Ala Asp Ala Arg Trp Ala Gly Val Leu Gly Pro Pro Gly Gin 
1 5 10 1.5 



Asp Ala Trp Thr Glu 
20 
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(2) INFORMATION FOR SEQ ID MO: 295: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH; 82 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

Pro Trp Phe Leu Cys Gly Val Thr Cys Gin Thr Arg Arg Arg Ser His- 

1 5 10 15 

Leu Phe Gly Gly Cys Ala Leu Trp His Ala Pro Leu Pro Pro lie Arg 
20 25 30 

Gly Pro Pro Ala Pro Arg Gly Pro Pro He His He Ala Ala Thr Thr 
35 40 45 

Ser Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe 
50 " : 55 60 

Leu Tyr Ser Ser Gly Asp Lys Xaa Th-r Ala Xaa Leu Leu Pro Thr Gin 
65 - 70 ■ 75 80 

Tyr He 



(2) INFORMATION FOR SEQ ID MO:296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 153 amino acids 

(B) TYPE: amino acid 

(C) 5TRAMDEDMESS : 

(0) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 296: 

Leu Ala Phe Gly Arg Phe Val Glu Thr Xaa Phe Leu Val Pro Gly Leu 

15 10 15 

Gly Cys Gin Asp Ser Pro Gin Val Ala Pro Pro Ala Pro Ala Xaa Leu 
20 25 30 

Ala Asn Ala Ala Pro Val Ser Gly Ala Ala Trp Glu Pro Arg Ala Val 
35 40' 45 

Pro Leu Arg Gly Val Pro Gin Asp Ala Leu Pro Ala Aia Ser Cys Gly 
50 55 60 

His Pro Ser Ser Arg Cys Leu Cys Pro Gly Glu Ala Pro Gly Leu Cys 
65 70 * 75 80 

Gly Gly Pro Arg Gly Gly Gly Thr Gin Thr Pro Val Ala Trp Cys . Ser 

85 90 95 
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Cys Ser Ala Ser Thr Ala Ala Pro Gly Arg Cys Thr Ala Ser Cys Gly 
100 105 HO 

Pro Ala Cys Ala Gly Trp Cys Pro Gin Ala Ser Gly Ala Pro Gly Thr 
115 120 125- 

Thr Asn.Ala Ala Ser Ser Gly Thr Pro Arg Ser Ser Ser Pro Trp Gly 
130 135 140 

Ser Met Pro Ser Ser Arg Cys Arg Ser 
145 150 



INFORMATION FOR SEQ ID NO:297: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE : amino acid 

(C) ST HANDEDNESS : ' 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:297: 

Ala Cys Glv Thr Ala Leu Gly Cys Ala Gly Ala Gin Gly Leu Ala Val 
I 5 - 10 15 

Phe Arg Pro Gin Ser Thr Val Cys Val Arg Arg Ser Trp Pro Ser Ser 
20 25 30 . 

Cys Thr Gly 
3 5 



INFORMATION FOR SEQ ID NO: 2 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 

(B) TYPE: amino acid 
.(C) ST RAND EDM ESS : 

{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 298: 

Val Cys Thr Ser Ser Ser Cys Ser Gly Leu Ser Phe Met Ser Arg Arg 
15 10 15 

Pro Arg Phe Lys Arg Thr Gly Ser Phe Ser Thr Gly Arg Val Ser Gly 
20 25 30 

Ala Ser Cys Lys Ala Leu Glu Ser Asp Ser Thr 
35 40 



200 



(2) INFORMATION FOR SEQ ID MO: 2 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

( B ) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 

Arg Gly Cys Ser Cys Giy Ser Cys Arg Lys Gin Arg Ser Gly Ser lie 
1 5 10 15 

Gly Lys Pro Gly Pro Pro Cys 
20 



(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(S) TYPE: amino acid 
(C) STRANDEDNESS: 
(-D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ 10 MO:300: 

Arg Pro Aso Ser Ala Ser Ser Pro Ser Leu Thr Gly Cys Gly Arg Leu 
1 ^ 5 10 15 



[2) INFORMATION FOR SEQ ID NO: 301: 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 23 amino acids 
(3) TYPE: amino acid' 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 301: 

Thr Trp Thr Thr Ser Trp Glu Pro Glu Arg Ser Ala Glu Lys Arg Gly 
1 5 10 15 

Pro Ser Val Ser Pro Arg Gly ■ 
20 
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(2) I N FORMAT ION FOR SEQ ID MO: 302: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:302: 

Arg His Cys Ser Ala Cys Ser Thr Thr Ser Gly Arg Gly Ala Pro Ala 

1 5 10 15 

Ser Trp Ala Pro Leu Cys Trp Ala Trp Thr He Ser Thr Gly Pro Gly 
20 25 30 

Ala Pro Ser Cys Cys Val Cys Gly Pro Arg Thr Arg Arg Leu Ser Cys 
35 40 45 

Thr Leu Ser Arg Trp Met 

50 " 



(2) INFORMATION FOR SEQ ID MO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



SEQUENCE ' DESCRIPTION : SEQ IDMO:303: " 

Arg Ala Arg Thr Thr Pro Ser Pro Arg Thr Gly Ser Arg Arg Ser Ser 
I ' 5 10 15 

Pro Ala Ser Ser Asn Pro Arg Thr Arg Thr Ala Cys Val Gly Ms. Pro 
20 25 30 

Trp Ser Arg Arg Pro Pro Met Gly Thr Ser Ala Arg Pro Ser Arg Ala 
35 40 45 

Thr Ser Leu Pro 
50 



(2) INFORMATION FOR SEQ ID MO: 304: ' 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
.(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: 

Gin Thr Ser Ser Arg Thr Cys Asp Ser Ser Trp Leu Thr Cys Arg Xaa 
15 10 15 

Thr Ala Arg 



(2) INFORMATION FOR SEQ ID NO: 30 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: . 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:305: 

Gly Met Pro Ser Ser Ser Ser. Arg Ala Pro Pro 
L ■ 5 10 



(2) INFORMATION FOR SEQ ID NO: 30 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 amino acids 
(3} TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

Met Arg Pro Ala Val Ala Ser Ser Th-r Ser Ser Tyr Ala Ser Cys Ala 
! 5 10 ' 15 

Thr Thr Pro Cys Ala Ser Gly Ala Ser Pro Thr Ser- Ser Ala Arg Gly 
20 25 30 

Ser Arg Arg Ala Pro Ser Ser Pro Arg Cys Ser Ala Ala Cys Ala Thr 
35 . 40 45 

Ala Thr Trp Arg Thr Ser Cys Leu Arg Gly Phe Gly Gly Thr Gly Cys 
50 55 60 

Ser Cys Val Trp Trp Net He Ser Cys Trp 
65 70 



(2) INFORMATION FOR SEQ ID MO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : " 2 4 amino acids 

(B) TYPE: amino _acid 

(C) STRANDEDNESS: 

( D } TOPOLOGY: linear 



203 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 307: 

His Leu Thr Ser Pro Thr Arg' Lys Pro Ser' Ser Gly Pro Trp Ser Glu 
1 5 10 15 

Val Ser Leu Ser Met Ala Ala Trp 
20 



INFORMATION FOR SEQ ID NO: 308: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(xi) SEQUENCE DESCRIPTION: SEQ- I D .NO: 308 

Thr Cys Gly Arg Gin Trp 
1 5 



IM FORMAT ION FOR SEQ ID NO: 30 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 
(3) TYPE: amino acid 
:C) STRANDEDNESS: 
[D) TOPOLOGY: linear 

;i.i) XOLECUIE TYPE: ceotice 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

Lys Thr Arg Pro Trp Val Ala Arg Leu Leu Phe Arg Cys Arg Pro Thr 
L ' ' 5 10 15 " 

Ala Tyr Ser Pro Gly Ala Ala Cys Cys Trp He Pro Gly Pro Trp Arg 
20 25 30 

Cys Arg Ala Thr Thr Pro Ala Met Pro Gly Pro Pro Ser Glu Pro Val 
35 40 45 

Ser Pro Ser Thr Ala Ala Ser Arg Leu Gly Gly Thr Cys Val Ala Asn 
50 55 60 

Ser Leu Gly Ser Cys Gly 
65 70 
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(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

Ser Val Thr Ala Cys Phe Trp lie Cys Arg 
1 5' 10 



(2) INFORMATION FOR SEQ ID NO : 3 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

CD} TOPOLOGY: linear 

{ii) MOLECULE TYPE": peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

Thr Ala Ser Arg Arg Cys Ala Pro Thr Ser Thr Arg Ser Ser Cys Cys 
1 5 10 15 

Arg Arg Thr Gly Phe Thr His Val Cys Cys Ser Ser His Phe He Ser 
20 25 3.0 

Lys Phe Gly Arg Thr Pro His Phe Ser Cys Ala Ser Ser Leu Thr Arg 
3 5 <\ 0 " <; 5 

Pro Pro Ser Ala Thr Pro Ser 
50 55 - 



INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 34 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

Lys Pro Arg Thr Gin Gly Cys Arg Trp Gly Pro Arg Ala Pro Pro Ala 
1 5 • 10 15 
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Leu Cys Pro Pro Arg Pro Cys Ser Gly Cys Ala Thr Lys His Ser Cys 
20 25 30 

Ser Ser 



(2) INFORMATION FOR. SEQ ID NO:313: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: - 
Leu Aso Thr Val Ser Pro Thr Cys His Ser Trp Gly His Ser Gly Gin 

Pro Arg Arg Ser 
20 



(2) I M FORMAT I ON FOR SEQ ID NO : 3 1'4 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 amino acids 
(3) TYPE: amino acid 
(C) STRAMDEDMESS : 
(O; TOPOLOGY: linear 

(ii) XOLECu'LE TYPE: peptide 



SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

Val Gly -Ser Ser Arg Gly Arg Arg 
1 5 



(2)- INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 61 amino acids 
(8) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 

Leu Pro Trp Arg Pro. Gin Pro Thr Arg His Cys Pro GLn Thr Ser Arg 
I " 5 10 15 

p^o- Ser Trp Thr Asp Glv His Pro Pro Thr' Ala Arg Pro Arg Ala Asp 
20 * 25 - 30 
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Thr Ser Ser Pro Val Thr Pro Gly Ser Thr Ser Gin Gly Gly Arg Gly 

35 40 45 

Gly Pro His Pro Giy Pro His Arg Trp Glu Ser Glu Ala 

50 " 55 60 



(2) INFORMATION FOR SEQ ID MO : 316: 

(iy SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 
(BJ TYPE: .amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 6 : 

Val Ser Val Trp Pro Arg Pro Ala Cys Pro Ala Glu Gly 
1 5 ~ 10 



INFORMATION FOR SEQ 10 NO: 317: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(:•:!) SEQUENCE DESCRIPTION: SEQ ID MO: 317: 

Gly Leu Ser Glu Cys Pre Ala Lys Gly 
1 5 



INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 ammo acids 
(8) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



[xi) SEQUENCE DESCRIPTION: SEQ ID MO: 318: 

Val Ser Ser Thr Pro Ala Val Phe Thr Ser Pro Gin Ala Gly Ala Arg 
15 10 15 

Leu His Pro Arg Ala Ser Phe Ser Ser Pro Gly Ala Arg Leu Pro Leu 
20 25 30 

"Pro Thr 
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INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS : 

CA) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:319: . 

Ser He Pro Arg Phe Ala He Val His Pro Ser Pro Cys Pro Pro Leu 
1 5 10 15 

Pro Ser Thr Pro Thr He Gin Val Glu Thr Leu Arg Arg Thr Leu Gly 
. 20 25 30 

Ala Leu Gly He Trp Ser Asp Gin 'Arg Cys Ala Leu. Tyr Thr Gly Glu 
35 40 45 

Asp Pro Ala Pro Gly Trp Gly Ser Leu Trp Val Lys Leu Gly Gly' Gly 
50 55 60 

Ala Val Gly Val Lys Tyr 
65 70 



INFORMATION FOR SEQ ID NO: 320: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 
(3) TYPE: amino acid 
(C) ST HANDEDNESS : 
(0) TOPOLOGY : linear 

Hi) MOLECULE -TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:320: 

He Tyr Glu Phe Phe Ser Phe Glu Lys. Lys Lys Lys Lys Lys Lys Lys 
15 10 15 



INFORMATION EOR SEQ ID MO: 321: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 222 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:321: 

Gin Arg Cys Val Leu Leu Arg Thr Trp Glu Ala Leu Ala Pro Ala Thr 
1 5 10 lo 
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Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu 
20 25 30 

Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg 
35 ' 40 45 

Leu Gly Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala 
50 ' 55 60 

Phe Arg Ala Xaa Val Ala Xaa Cys Xaa Val Cys Val Pro Trp Xaa Xaa 
65 70 75 80 

Xaa Xaa Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Xaa 
85 90 95 

Xaa Leu Val Ala Arg Val Leu Xaa Xaa Leu Cys Xaa Arg Gly Ala Xaa 
100 105 110 

Asn Val Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly 
115 120 125 

Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr 
130 135 140 

Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg 
145 150 " 155 160 

Arg Val Gly Asp Asp Val Leu- Val His Leu Leu Ala Arg Cys Ala Xaa 
165 170 175 

Phe Val Leu Val Xaa Pro Ser Cys Ala Tyr Xaa Val Cys Gly Pro Pro 
130 135 190 

Leu Tyr Gin Leu Gl-y Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala 
" 195 200 205 

Xaa. Gly Pro Glu Xaa Val Trp Asp Pro Thr Gly Leu Glu Pro 
2l5 215 V 220 



1 N FORMAT I GM EOR SEQ ID MO: 322: 

(ij SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 330 amino acids 
(3) TYPE : amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY : linear 



(ii) MOLECULE TYPE : peptide 



(:<i) SEQUENCE DESCRIPTION: SEQ ID MO: 322: 

Arg Gin Gly Gly Arg Gly Pro Pro Gly Leu Pro Ala Pro Gly Ala Arg 
i 5 10 15 

Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro 
20 25 30 



Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly 
3 5~ 4 0 4 5 
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Ser Trp Ala His Pro Gly Arg Thr Pro Gly Pro Ser Asp Arg Gly Phe 
50 55 60 

Cys Val-'Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu 
65 70 75 80 

Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg. Gin 
85 90 95 

His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Gly Thr 
100 105 110 

Arg Leu Val Pro Arg Cys Thr Pro Arg Pro Ser Thr Ser Ser Thr Pro 
115 120 125 

Gin Ala Thr Xaa Thr Leu Arg Pro Ser Phe Leu Leu Asn lie Ser Glu 
130 " 135 140 

Ala Gin Pro Asp Trp Arg Ser Gly Gly Ser Trp Arg Xaa Ser Phe Trp 
145 150 155 160 

Phe Gin Ala Leu Asp Ala Arg lie Pro Arg Arg Leu Pro Arg Leu Pro 
165 170 175 

Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe 'Leu Glu Leu Leu Gly Asn 
18'0 135 " 190 

His Ala Gin Cys Pro Tyr Gly Val Phe Leu Lys Thr H"is Cys Pro Leu 
195 200 205 

Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro 
210 . 215 220 

Gin Gly Ser Val Ala Ala Pro Glu Glu Glu Glu His Arg Pro Pro Ser 
225 ^ 230 235 240 

Pro Gly Ala Ala Ala Pro Pro Ala Gin Gin Pro Leu 'Ala Gly Val Arg 
245 250 255 

Leu Arg Ala Gly Leu Pro Ala Pro Ala Gly Ala Pro Arg Pro Leu Gly 
260 265 270 

Leu Gin Ala Gin Arg Thr Pro Leu Pro Gin Glu His Gin Glu Val His 
275 230 235 

Leu Pro Gly Glu Ala Cys Gin Ala Leu Ala Ala Gly Aia Asp Val Glu 
290 295 300 

As d Glu Arg Ala Gly Leu Arg Leu Ala Ala Gin Glu Pro Arg Gly Trp 
305 310 315 320 



Leu Cys Ser Gly Arg Arg Ala Pro Ser Ala 
' 325 330 
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INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS : 
- (A) LENGTH: 89 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(0) .TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Gly Aso Pro Gly Gin Val Pro Ala Leu Ala Asp Glu Cys Vai Arg Arg 
1 5 10 15 

Arg Aia Ala Gin Val Phe Leu Leu Cys His Gly Asp His Vai Ser Lys 
20 25 30 



Glu Gin Ala Leu Phe Leu Pro Glu Glu Cys Leu Glu Gin Val Ala Lys 

35 40 . 45 

His Trp Asn Gin Thr Ala Leu Glu Glu Gly Ala Ala Ala Gly Ala Val 
50" 55 60 

Gly Ser Arg Gly Gin Ala Ala Ser Gly Ser Gin Ala Arg Pro Ala Asp 
65 " 70 75 80* 

Val Gin Thr Pro Leu His Pro Gin Ala 
35 



INFORMATION PGR SEQ ID NO: 324: 

(i) SEQUENCE' CHARACTERISTICS: 

(A) LENGTH: 7 6 amino acids 

.{3) TYPE: amino acid 

(C) STRANDEDNESS: 

(0} TOPOLOGY : linear ~ 

(ii) KOLECULE TYPE: peptide 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 

Arg Ala Ala Ala Asp Cys Glu His Giy Leu Arg Arg Gly Ser Gin Asn 

1 5 10 15 

Val Pro Gin Arg Lys Glu Gly Arg Ala Ser His Leu Glu Gly Glu Gly 
20 25 30 

Thr Val Gin Arg Ala Gin' Leu Arg Ala Gly Ala Ala Pro Arg Pro Pro 

35 40 45 

Gly Arg Leu Cys Ala Gly Pro Gly Arg Tyr Pro Gin Gly Leu Ala His 
50 55 , 60 



Leu Arg Ala Ala Cys Ala Giy Pro Gly Pro Aia Aia 
65 70 75 
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INFORMATION FOR SEQ ID MO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(Li) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325: 



Ala Val Leu Cys Gin Gly Gly Cys Asp Gly Arg Val Arg His His Pro 
1 5 -10 15 

Pro Gly Gin Ala His Gly Gly His Arg Gin His His Gin Thr Pro Glu 
20 ■ 25 30 . 

His Val Leu Arg Ala Ser Val Cys Arg Gly Pro Glu Gly Arg Pro Trp 

35 40 45 

Ala Arg Pro Gin Gly Leu Gin Glu Pro Arg Leu Tyr Leu Asp Arg Pro 
50^ 55 60 

=>ro Ala Val His Ala Thr Val Arg Glv Ser Pro Ala Gly Xaa Gin Pro 
65 70 / 75 _ - 30 

Ala Glu Gly Cys Arg Arg His Arg Ala Glu Leu Leu Pro Glu 
35 90 



IN FORMAT I OM FOR SEQ ID MO: 32 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : '63 amino acids 
(2) TYPE: amino acid 
(C) STRANDEDNESS : 
[0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO:326; 

Gly Gin Gin Trp Pro Leu Arg Arg Leu Pro Thr Leu His Val Pro Pro 
I 5 10 15 

Arg Arg Ala His Gin Gly Gin Val Leu Arg Pro Val Pro Gly Asp Pro 
20 25 30 

Ala Gly Leu His Pro Leu His Ala Ala Leu Gin Pro Val Leu Arg Arg 
35 40 45 

His Gly Glu Gin Ala Val Cys Gly Asp Ser Ala Gly Arg Ala Ala Pro 
50* 55 60 



Ala ?he Gly Gly 
65 
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INFORMATION FOR SEQ ID NO : 3 2 7 : 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 23 amino acids 
,{B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:327 



Phe Leu Val Gly Asp Thr Ser Pro His Pro Arg Glu Asn Leu Pro Gin 
1 5 10-15 

Asp Pro Gly Pro Arg Cys Pro 
20 



INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



>;i ) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 

Val Trp Leu Ara Gly Glu- Leu Ala Glu Asp Ser Gly Glu leu Pro Cys 
1 * " ' 5 10 "15 

Arg Arg Arg Gly Pro Gly Trp His Gly Phe Cys Ser Asp Ala Gly Pro 
20 25 30 

Arg Pro He Pro Leu Val Arc Pro Ala Ala Gly Tyr Pro Asp Pro Gly 
35 J 40 45 

Gly Ala Glu Arg Leu Leu Gin Leu Cys Pro Asp Leu His Gin Ser Gin 
50 55 60 

Ser His Leu Gin Pro Arg Leu Gin Gly Trp Glu Glu His Ala Ser Gin 
65 70 75 30 

Thr Leu Trp Gly Leu Ala Ala Glu Val Ser Gin Pro Val Ser Gly Phe 
35 50 95 

Ala Gly Glu Gin Pro Pro Aso Gly Val His Gin His Leu Gin Asp Pro 
100 105 HO 

Pr-o Ala Ala Gly Val Gin Val Ser Arg Met Cys Ala Ala Ala Pro He 
115 120 125 

Ser Ser Ala Ser Leu Glu Glu Pro His He Phe Pro Ala Arg His Leu 
130 135 140 
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INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 137 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS: 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:329: 

His Gly Leu Pro Leu Leu Leu His Pro Glut Ser Gin Glu Arg Arg Asp 
1 5 10 15 

Val Ala Gly Gly Gln'Gly Arg Arg Arg Pro Ser Ala Leu Arg Gly Arg 
20 25 30 

Ala Val Ala Val Pro Pro Ser lie Pro Ala Gin Ala Asp Ser Thr Pro 
35 40 45 

Cys His Leu Arg Ala Thr Pro Gly Val Thr Gin Asp Ser Pro Asp Ala 
50 55 60 

Ala Glu Ser Glu Ala* Pro Gly Asp Asp Ala Asp Cys Pro Gly Gly Arg 
65 70 75 80 

S^r Gin Pro Gly Thr Ala Leu Arg Leu Gin Asp His Pro Gly Leu Met 
85 90 95 

Ala Thr Arg Pro Gin Pro Gly Arc Glu Gin Thr Pro Ala Ala Leu Ser 
100 " 105 110 

Arg Arg Ala Leu Arg Pro Arg Glu Gly Gly Ala Ala His Thr Gin Ala 
LIS 120 125 

Ara Thr Ala Gly Ser Leu Arg Pro Glu 
130 ~ 135 



INFORMATION EOR SEQ 10 MO: 330: 

(i) SEQUENCE: CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(3) TYPE: amino acid 

(C) S T RAM 0 EDM ESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:330: 

Val Phe Gly Arg Gly Leu His Val Arg Leu Lys Ala Glu Cys Pro Ala 
15 10 15 



Glu Ala 
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(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS : 

(A) -LENGTH: 71 amino acids 

(B) TYPE: amino acid 

(C) " STRANDEDMESS : 

. (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 

Ala Ser Val Gin Pro Arg Ala Giu Cys Pro Ala His Leu Pro' Ser Ser 
1 5 10 15 

Leu Pro His Arg Leu Ala Leu Gly Ser Thr "Pro Gly Pro Ala Phe Pro 
20 25 30 

His Gin Giu Pro Gly Phe His Ser Pro His Arg Asn Ser Pro Ser Pro 
- 35 40 45 

Asp Ser Pro Leu Phe Thr Pro Arg Pro Ala Leu Leu Cys Leu Pro Pro 
'■ * * 50 55 60 

Pro Pro Ser Arg Trp Arg ?ro 
- 65 70 



;2) IN FORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 0 amino acids 

{3) TYPE: amino acid.. 

;C) STRANDEDMESS : 

.0) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: peptide 

(x.i) SEQUENCE DESCRIPTION: SEQ ID MO: 3 32: 

Giu Giv Pro Tro Giu Leu Tro Giu Phe Gly Val Thr Lys Gly Val Pro 
1 ' 5 ' 10 . 15 . 

Cys Thr Gin Ala Arg Thr Leu His Leu Asp Gly Gly Pro Cys Gly Ser 
20 25 30 

Asn Tro Gly Giu Val Leu trp Giu 
35 40 



(2) INFORMATION EOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDMESS: 
(0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 333: 

Asn Thr Glu Tyr Met Ser Phe Ser Val Leu Lys Lys Lys Lys Lys Lys 
1 " 5 10 15 

Lys Lys 



(2) INFORMATION FOR SEQ' ID NO:334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



;xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

Ser Ala Ala Ser Cys Cys Ala Arg Gly Lys Pro Trp Pro Arg Pro Pro 
15 10 15 

Pro Arc Cys Ara Ala Lea Pro Ala Ala Glu Pro Cys Ala Pro Cys Cys 
2<T 25"' 30 

Ala Ala Thr Thr Ala Arg Cys Cys Arg Trp Pro Arg Ser Cys Gly Ala 
35 40 ' 45 

Tro Gly Pro Arg Ala Gly Gly Trp Cys Ser Ala Gly Thr Arg Arg Leu 
50 * 55' * 60 

Sor Ala Arg Tro Tro Pro Xaa Ala Tro Cys Ala Cys Pro Gly Xaa Xaa 
65 70 "75 30 

GIv Xaa Pro Pro Pro Pro Pro Pro Ser Ala Arg Cys Pro -Ala 
85 • 90 



(2) INFORMATION FOR SZQ ID NO: 33 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 ammo acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 

Xaa Xaa Trp Tro Pro Glu Cys Cys Xaa Xaa Cys Ala Xaa Ala Ala Arg 
15 10 15 

Xaa Thr Cvs Tro Pro Ser Ala Ser Arg Cys Trp Thr Gly Pro Ala Gly 
20* 25 30 
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Ala Pro Pro Arg Pro Ser Pro Pro Ala Cys Ala Ala Thr Cys Pro Thr 
35 40 45 

Arg 



INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTR: 1003 amino acids 
( 8) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 336: 

Pro Thr His Cys Gly Gly Ala Gly Arg Gly Gly Cys Cys Cys Ala Ala 
15 10 15 

■ Trp Ala Thr Thr Cys Trp Phe Thr Cys Trp His Ala Ala Arg Kaa Leu 
20 25 30 

Cys Trp Trp Xaa Pro Ala Ala Pro ThTr Xaa Cys Ala Gly Arg Arg Cys 
35 40 45 

Thr Ser Ser Ala Leu Xaa Leu Arg Pro Gly Pro Arg His Thr Leu Xaa 
50 55 60 

Asp Pro Xaa Ala Ser Gly lie Gin Arg Ala Trp Asn His Ser Val Arg 
65' .70 75* 80 

Glu Ala Gly Val "Pro Leu Gly Cys Gin Pro Arg Val Arg Gly Gly Ala 
35 90 95 

Gly Ala Val Pro Ala Glu Val Cys Arg Cys Pro Arg Gly Pro Gly Val 
100 105"' 110 

Ala Leu Pre Leu Ser Arg Ser Gly Arg Pro Leu Gly Arg Gly Pro Gly 
115 120 125 

Pro Thr Arg Ala Gly Arg Leu Asp Arg Val Thr Val Val Ser Val Tr_p 
130 135 " 140 

Cys His Leu Pro Asp Pro Pro Lys Lys Pro Pro Leu Trp Arg Val Arg 
145 ' 150 155 160 

Ser Leu Ala Arg Ala Thr Pro Thr His Pro Trp Ala Ala Ser Thr Thr 
165 170 175 

Arg Ala Pro His Pro His Arg Gly His His Val Leu Gly His Ala Leu 
180 135 190 

Ser Pro Gly Val Arg Arg Asp Gin Ala Leu Pro Leu Leu Leu Arg Arg 
195 200 205 



Gin Xaa His Cys Xaa Pro Pro Ser Tyr Ser lie Tyr Leu Arg Pro Ser 
210 * 215 220 
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Leu Thr Gly Val Arg Glu Val Arg Gly Asp Xaa Leu Ser Gly Ser. Arg 
225 230 235 240 

Pro Trp Met Pro Gly Phe Pro Ala Gly Cys Pro Ala Cys Pro Ser Xaa 
245 250 255 

Thr Gly Lys Cys Gly Pro Cys Phe Trp Ser Cys Leu Gly Thr Thr Arg 
260 265 270 

Ser Ala Pro Thr Gly Cys Ser Ser Arg Arg Thr Ala Arg Cys Glu Leu 
1 275 . 230 285 

Arg Ser Pro Gin Gin Pro Val Ser Val Pro Gly Arg Ser Pro Arg Ala 
290 295 300 

Leu Trp Arg Pro Pro Arg Arg Arg Asn Thr Asp Pro Arg Arg Leu Val 
305 _ 310' 315 320 

Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
325 330 335 

Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
340 345 350 

His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys. Phe He 'Ser Leu 
355 " • 360 ' 365 

Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
370 375 380 

Val Arg Asp Cvs Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
335 " * 390 395 400 

Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe Leu 
405 4 10 4 15 

His Tro Leu Met' Ser Val. Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
420 425 430 

Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
435 440 445 

Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly lie Arg Gin His Leu 
450 4 55 4 60 

Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 
465 470 475 480 

Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro 
435 490 495 

Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 
500 505 510 

Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
515 ' 520 - 525 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
530 535 540 
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Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arc Ala 
545 550 555 560 

Trp Arg Thr -Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
565 - 570 575 

Leu Tyr. Phe Vai Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He Pro 
580 585 590 

Gin Asp Arg Leu .Thr Glu Val lie Ala S.er lie lie Lys Pro Gin Asn 
595 600 605 

Thr Tyr Cys Val Arg Arg Tyr "Ala Val Val Gin Lys Ala Ala His Gly 
610 615 620 

His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu 
625 630 635' * 640 

Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Xaa Asn Ser Pro 
645 650 655 

Leu Arg Asp Ala Vai Val lie Glu Gin Ser Ser Ser Leu Asn Glu Ala 
660 665 670 

Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 
675 680 " 685 

Val Arg lie Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly lie Pro Gin. 
690 695 700 

Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met 
705 7L0 7L5 ' * 720 

Glu Asn Lys Leu Phe A.la Gly lie Arg Arg Asp Gly Leu Leu Leu Arg 
7~25 730 ' .735 ■ 

Leu Val Asp Asp Phe Leu Leu Vai Thr Pro His Leu Thr His Ala Lys 
740 745 750 

Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Val 
755 760 765 

Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu Ala 
770 775 730 

Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
735 790 795 300 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
805 810 815 

Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe Asn 
820 325 830 

Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val 
335 " 840 845 



Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp, Leu Gin Val Asn Ser 
850 855 860 
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Leu Gin Thr Val Cys Thr Asn He Tyr Lys He Leu Leu Leu Gin Ala 
865 870 8 75 

Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
y 835 890 , 895 

Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala Ser 
900 905 

Leu Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
915 920 925 

Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
930 935 940 

Cys His Gin Ala Phe Leu Leu Lys Leu Thr. Arg His Arg Val Thr Tyr 

945 . 950 955 

Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg " 

965 970 97d 

Lvs Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
980 985 990 

Ma t=u Pro Ser Asp Phe Lys Thr He Leu Asp 
995 ' 10°° " 



;2) INFORMATION FOR SEQ ID MO:337: 

(i) SEQUENCE CHARACTERISTICS: - 
(A) LENGTH: 38 amino acids 
•3) TYPE: amino acid 
(C) STRANDEDMESS : 
' D ) TOPOLOGY: linear . 

'(Li) MOLECULE TYPE : . pept ide 
(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

Tro Pro Pro Ala His Ser Gin Ala Glu Ser Arg His Gin Gin Pro Cys 

r 5 10 . 

Uis & ia Gly Leu Tyr Val Pro Gly Arg Glu Gly Arg Pro Thr Pro Arg 
- 20 25 30 

Pro Ala Pro Leu Gly Val 
35 



(2) INFORMATION TOR SEQ ID MO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 
(•3) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 
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(xx) SEQUENCE DESCRIPTION: SEQ ID NO:338: 

Gly Leu Ser Glu Cys Leu Ala Glu Ala Cys Met Ser Gly 
1 .5 10 



INFORMATION . FOR SEQ ID NO: 33 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 amino acids 
(S) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE. DESCRIPTION: SEQ ID NO:339: 

Arg Leu Ser Val Arg Leu Arg Pro Glu Arg Val Ser Ser Gin Gly Leu 
1 5 10 15 

Ser Val Gin His Thr Cys Arg Leu His Phe Pro Thr Gly Trp Arg Ser 
20 25 30 



Ala Pro Pro Gin Gly Gin Leu Phe Le-u Thr Arg Ser Pro Ala Ser Thr 
35 40 45 



Pro His lie Gly He Val His Pro Gin He Arg His Cys Ser Pro Leu 
50 . 55 60 



Ala Leu Pro Ser Phe Ala Phe His 
65 70 

Glu Lys Asp Pro Gly -Ser Ser Gly 
85 



INFORMATION FOR SEQ ID MO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



Pro His Kis Pro Gly Gly Asp Pro 
75 80 

Asa Leu Glu 
90 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

Pro Lys Val Cys Pro Val His Arg Arg- Gly Pro Cys Thr Trp Met Gly 
15 10 15 

Val Pro Val Gly Gin He Gly Gly Arg Cys Cys Gly Ser Lys He Leu 
20 25 30 



Asn He 
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(2) INFORMATION FOR SEQ ID MO: 34 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 1: 

Val Phe Gin Phe 
1 



(2) INFORMATION FOR SEQ ID NO: 34 2: 

*"(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 amino acids 
^ (8) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

Lys Lys Lys Lys Lys Lys Lys Lys " 
1 5 



(2) INFORMATION FOR SEQ ID NO: 34 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4037 base -pairs 
(3) TYPE: nucleic acid 
(C)_ STRANDEDNESS: single 
<D)~ TOPOLOGY: linear' 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/ KEY : CDS 

(B) LOCATION: 56 . . 3454 

(D) OTHER INFORMATION: /note- "refined sequence of hTRT cDNA" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC CCGCG ATG 

Met 



CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC CAC . 10 6 

Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 
5 10 15 ■ 
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TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG GGG CCC 15 4 

Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe' Val Arg Arg Leu Gly Pro 
20 25 30 

CAG GGC .TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT TTC CGC GCG 20 2 

Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 
35 40 45 

CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC GCA CGG CCG CCC 250 
L eu val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 65 

CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC CTG AAG GAG CTG GTG 298 
Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 

GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC GGC GCG AAG AAC GTG CTG 34 6 

ALa Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 
85 90 95 

GCC TTC GGC TTC GCG CTG CTG GAC GGG GCC CGC GGG GGC CCC CCC GAG 394 
Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 110 

GCC TTC ACC ACC AGC GTG CGC AGC TAC CTG CCC AAC ACG GTG ACC GAC 442 
Ala Phe Thr Thr Ser Val Arg Ser Tyr Lev, Pro Asn Thr Val Thr Asp 
115 - 120 125 

GCA CTG CGG GGG AGC GGG GCG TGG GGG CTG CTG CTG CGC CGC GTG GGC 4 90 

Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 
130 135 140 145 

GAC GAC GTG CTG GTT CAC CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG 5 38 

Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 " 155 ^60 

GTG GCT CCC AGC TGC. GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG 58 6 

Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 
165 ' HO ~ 175 

CTC GGC GCT GCC ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC 634 
Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 
130 185 . 190 

CGA AGG CGT CTG GGA TGC GAA CGG GCC TGG AAC . CAT AGC GTC AGG GAG 68 2 

Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 
195 200 205 

GCC GGG GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG 7 30 

Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210 215 220 225 

GGC AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 7 78 

Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly 
230 235 240 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG GCC 8 26 

Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 
245 250 255 
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CAC 
His 



CCG 
Pro 



GGC AGG ACG 
Gly Arg Thr 
260 



CGT GGA 
Arg Gly 



CCG 
Pro 
265 



AGT 
Ser 



GAC 
Asp 



CGT GGT 
Arg Gly 



TTC 
Phe 
210 



TGT GTG GTG 
Cys Val Val 



874 



TCA 
Ser 



CCT 
Pro 
275 



GCC AGA CCC 
Ala Arg Pro 



GCC GAA 
Ala Glu 
280 



GAA 
Glu 



GCC 
Ala 



ACC 
Thr 



TCT TTG 
Ser Leu 
285 



GAG GGT GCG CTC 
Glu Gly Ala Leu 



922 



TCT 
Ser 
290 



GGC ACG CGC CAC 
Gly Thr Arg His 



TCC CAC 
Ser His 
295 



CCA 
Pro 



TCC 
Ser 



GTG 
Val 



GGC CGC 
Gly Arg 
300 



CAG 
Gin 



CAC CAC GCG 
His His Ala 
305 



970 



GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC TGG GAC ACG CCT TGT 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 



1018 



CCC CCG GTG TAC GCC GAG ACC AAG CAC 
Pro Pro Val Tyr Ala Glu Thr Lys His 
325 330 



TTC CTC TAC TCC 
Phe Leu Tyr Ser 



TCA GGC GAC 
Ser Gly Asp 
335 
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PAG GAG CAG CTG CGG CCC TCC TTC CTA CTC AGC TCT CTG 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu 
340 345 . 350 



AGG CCC AGC 
Arg Pro Ser 



1114 



CTG 
Leu 



ACT. 
Thr 
355 



GGC GCT CGG 
Gly Ala Arg 



AGG CTC 
Arg Leu 
360 



GTG 
Val 



GAG 
Glu 



ACC 
Thr 



ATC TTT 
He Phe 
365 



CTG GGT TCC AGG 
Leu Gly Ser Arg 
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CCC 
Pro 
370 



TGG 



ATG CCA GGG 
Met Pro Gly 



ACT CCC 
Thr Pro 
375 



CGC 
Arg 



AGG 
Arg 



TTG 
Leu 



CCC CGC 
Pro Arg 
380 



CTG 
Leu 



CCC CAG CGC 
Pro Gin Arg 
385 
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TAC TGG CAA ATG CGG CCC CTG 
Tyr Trp Gin Met Arg Pro Leu 
390 



TTT 
Phe 



CTG 
Leu 



GAG 
Glu 
395 



CTG CTT 
Leu Leu 



GGG 
Gly 



AAC CAC GCG 
Asa His Ala 

400 



1253 



CAG 
Gin 



GC 



CCC TAC GGG 
Pro Tyr Gly 
405 



GTG CTC 
Val Leu 



CTC 



AAG 
Lys 
4 10 



ACG CAC TGC CCG 
Thr His Cys Pro 



CTG CGA GCT 
Leu Arg Ala- 
415 



1306 



GCG 
Ala 



GTC 
Val 



ACC CCA GCA 
Thr Pro Ala 
420 



GCC GGT 
Ala Gly 



GTC 
Val 
425 



TGT 
Cys 



GCC CGG 
Ala Arg 



GAG AAG 
Glu Lys 
430 



CCC CAG GGC 
Pro Gin Gly 



1354 



TCT 
Ser 



GTG 
Val 
435 



GCG GCC CCC 
Ala Ala Pro 



GAG GAG 
Glu Glu 
440 



GAG 
Glu 



GAC 
Asp 



ACA GAC 
Thr Asp 



CCC CGT 
Pro Arg 
44 5 



CGC CTG GTG 
Arg Leu Val 



1402 



CAG 
Gin 
450 



CTG 
Leu 



CTC CGC CAG 
Leu Arg Gin 



CAC AGC 
His Ser 
455 



AGC 
Ser 



CCC 
Pro 



TGG CAG 
Trp Gin 
460 



GTG TAC 
Val Tyr 



GGC TTC GTG 
Gly Phe Val 
465 



1450 



CGG GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 
Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 



14 98 



CAC 
His 



AAC 
Asn 



GAA CGC CGC 
Glu Arg Arg 
485 



TTC CTC 
Phe Leu 



AGG 
Arg 



AAC 
Asn 
490 



ACC AAG 
Thr Lys 



AAG TTC 
Lys Phe 



ATC TCC CTG 
He Ser Leu 
495 



1546 
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GGG AAG CAT GCC AAG - CTC TCG CTG CAG GAG CTG ACG TGG AAG ATG AGC '15 94 

Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
500 505. 510 

GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT GGC TGT GTT 164 2 

Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 
515 520 . 525 

CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG GCC AAG TTC CTG 16 90 

Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe Leu 
530 535 540 545 

CAC TGG CTG ATG ACT GTG TAG GTC GTC GAG CTG CTC AGG TCT TTC TTT 17 38 

His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

TAT GTC ACG GAG ACC ACG TTT CAA" AAG AAC AGG CTC TTT TTC TAC CGG 17 8 6 

Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg . . 

565 570 575 

CCG AGT GTC TGG AGC AAG TTG CAA AGC ATT GGA ATC AG A CAG CAC TTG 18 3 4 

Pro Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His Leu 
580 585 590 

AAG AGG GTG CAG CTG CGG GAG CTG TCG GAA GCA GAG GTC AGG CAG CAT 183 2 

Lys Arg Val Gin Leu Arg Glu Leu Ser Glu- Ala Glu Val Arg Gin His 
595 _ 600 605 

CGG GAA GCC AGG CCC GCC CTG CTG ACG TCC AG A CTC CGC TTC ATC CCC 193 0 

Ar: Glu Ala Arg Pro Ala Leu Leu. Thr Ser Arg Leu Arg Phe lie Pro 
610 615 620 . 625 

AAG CCT GAC GGG CTG CGG CCG ATT GTG AAC ATG GAC TAC GTC GTG GGA 197 8 

Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val Gly 
630 - 635 " 6AQ 

■j:: AGA ACG TTC CGC AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AG 3 20 2 6 

Ali Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Ar; 

6 4 5 650 - 6 5 5 

GTG AAG GCA CTG TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC ■ 2074' 

Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
660 665 670 

GGC CTC CTG GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC 2122 

Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg Ala 
675 680 685 

TGG CGC ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG 2170 

Tro Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
690 695 700 705 

CTG TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ' ACC ATC CCC 2218 

Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He Pro 
710 715 720 

CAG GAC AGG ■ CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 2 2 66 

Gin Asp Arg Leu Thr Glu Val lie Ala Ser He lie Lys Pro Gin Asn 

725 730 735 
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ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC CAT GGG 2 314 

Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 

CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG AC A GAC CTC 2 362 

His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp Leu 
755 760 765 

CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG GAG ACC AGC CCG 2 410 

Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser Pro 
770 775 780 785 

CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC TCC CTG AAT GAG GCC 2 4 58 

Leu Arg Asp Ala Val Val lie Glu Gin Ser Ser Ser Leu Asn Glu Ala 
790 795 800 

AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC TTC ATG TGC CAC CAC GCC 2 50 6 

Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His Ala 
305 810 815 

GTG CGC ATC AGG GGC AAG TCC TAC GTC CAG TGC CAG GGG ATC CCG CAG 2 55 4 

Val Arg lie Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro Gin 
820 825 830 

GGC TCC ATC CTC TCC ACG CTG CTC TGC AGC CTG TGC TAC GGC GAC ATG - 2 602- 
GLy Ser tie Leu Ser Thr Leu Leu Cys Ser- Leu Cys Tyr Gly Asp Met 
'3 35 84 0 8 45 

GAG A.AC AAG CTG TTT GCG GGG ATT CGG CGG GAC GGG CTG CTC CTG CGT" 2 650 

Glu Asn Lys Leu Phe Ala Gly lie Arg Arg Asp Gly Leu Leu Leu Arg 
350 855 360 - 365 

TTG GTG GAT GAT TTC TTG TTG GTG AC A CCT CAC CTC ACC CAC GCG AAA 2698 
Leu Val Asc Aso Phe Leu Leu Val Thr Pro His Leu Thr His Ala Lys 
870 375 330 

ACC TTC CTC AGG ACC CTG GTC CGA GCT GTC CCT GAG TAT GGC TGC GTG 27 46 

Thr ?he L-rj Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Giy -Cys Val 
3 35 - 3 90 3 95 

CTG A.AC TTG CGG AAG AC A GTG GTG AAC~ TTC CCT GTA GAA GAC GAG GCC 27 94 

Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Vai Glu Asp Glu Ala 
90C 905 910 

CTG GGT GGC ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC 23 4 2 

Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro 
915 920 925 

TGG TGC GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC 28 90 

Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp 
930 J 935 ' 940 945 

TAC TCC AGC TAT GCC CGG ACC TCC ATC AG A GCC AGT GTC ACC TTC AAC 2 9 38 

Tvr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Val Thr Phe Asn 
950 955 . 960 

CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG GTC 2 93 6 

Arg Glv Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val 
965 970 975 
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TIG CGG CTG AAG TGT CAG AGC CTG TTT CTG GAT TTG GAG GTG AAC AGC 3 03 4 

Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
980 935 990 

GTG CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG CTG CAG GCG 303 2 

Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin Ala 
995 1000 ■ 1005 

T AC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT CAT CAG CAA. GTT 3130 
Tyr Arg Phe His Ala Cys Val. Leu Gin Leu Pro Phe His Gin Gin Val 
1010 1015 1020 1025 

TGG AAG AAC CCC AC A TTT TTC CTG CGC GTC'ATC TCT GAC ACG GCC TCC 3173 
Tro Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser Asp Thr Ala Ser 
1030 1035 1040 

CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC GCA GGG ATG TCG' CTG GGG 3226 
Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
1045 1050 1055 

GCC AAG GGC GCC GCC GGC CCT CTG CCC TCC GAG GCC GTG CAG TGG CTG 3 27 4 

Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
1060 1065 1070 

T3C~CACCAA GCA TTC CTG CTC. AAG CTG ACT CGA CAC CGT GTC ACC TAC 3322 
Cys Kis Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 
1075 - 1030 " . 1085 

GTG CCA CTC CTG GGG TCA CTC AGG AC A GCC. CAG ACG CAG CTG ACT CGG 3 3 70 

Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arc 
1030 1095 1100 ' 1105 

A A G CTC CGG GGG ACG ACG CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG 34 18 

Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
1110 - 1115 1120 

"A CTG CCC TCA GAC TTC AAG ACC ATC CTG GAC TGATGGCCAC CCGCCCACAG 34 71 

Ala Leu E?ro Ser Asp Phe Lys Thr lie Leu Asp 
1125 * 1130 

GCACCCCGAG AGCAGACACC AGCAGCCCTG TCACGCCGGG CTCTACGTCC CAGGGAGGGA ' 35 31 

GGGGCGGCCC ACACCCAGGC CCGCACCGCT GGGAGTCTGA GGCCTGAGTG AGTGTTTGGC 35 91 

CGAGGCCTGC ATGTCCGGCT GAAGGCTGAG TGTCCGGCTG AGGCCTGAGC GAGTGTCCAG 35 51 

CC AAG GG CTG AGTGTCCAGC ACACCTGCCG TCTTCACTTC CCCACAGGCT GGCGCTCGGC 37 11 

TCCACCCCAG GGCCAGCTTT TCYTCACCAG GAGCCCGGCT TCCACTCCCC ACATAGGAAT 37 7 L 

AGTCCATCCC CAGATTCGCC ATTGTTCACC CYTCGCCCTG CCYTCCTTTG CCTTCCACCC 38 31 

CCACCATCCA GGTGGAGACC CTGAGAAGGA CCCTGGGAGC TCTGGGAATT TGGAGTGACC 33 91 

AAAGGTGTGC CCTGTACACA GGCGAGGACC CTGCACCTGG ATGGGGGTCC CTGTGGGTCA 3951 

AATTGGGGGG AGGTGCTGTG GGAGTAAAAT ACCGAATATA TGAGTTTTTC AGTTTTGPAA 4011 

AAAAAAAAAA AAAAAAAAAA AAAAAA 4 0 37 
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(2) INFORMATION FOR SEQ ID NO: 344: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

iy»t Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
'\ 5 10 15 

His t V v A rc Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu.Gly 
20 25 30 

=>r- 0 Gin Glv Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 4 0 4 5 

:i a L o U Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 



! :o Pro 
6 5 



Ai^ Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
' " ' 70 75 80 



,.^ L „ la - n_ Vai Leu GIn Arg Leu Cys Glu Arg Gly Ala Lys Asa Val 
85 90 95 

--u " ,! a Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 10 5 - HO 

Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 125 

l so --g Glv Ser Gly Ala Tro Gly Leu Leu. Leu Arg Arg Val 

V58 ^ ' 135 1^0 

- : so Asc Val Leu Val His Leu~ Leu Ala Arg Cys Ala Leu Phe Vai 
■ 150 _ 155 1^0 

a \ * ^ro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 HO 175 . 

r.in Giv Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 

~~ ' " 130 135 190 

= -0 Arg Arc Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
19 5 200 205 

Giu Ai a Glv Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 " 215 220 

Glv GW S-- Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 
225 ~" ~ 230 235 240 

riv ai a *1» a ro Glu Pro Glu Arg Thr Pro Val Gly GIn Gly Ser Trp 
* 245 250 255 

aia K<s P-o Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys. Val 
260 265 270 
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Val Ser Pro Ala 
275 

Leu Ser Gly Thr 
- 290 

A.la Gly Pro Pro 
305 

Cys Pro Pro Val 



Asp Lys Glu Gin 
340 

Ser Leu Thr Gly 
355 

Arg Pro Trp Met 
370 

Arc Tyr Trp Gin 
335 

Ala Gin Cys Pro 



Ala Ala Val Thr 
420 

Gly Ser Val Ala 

Yal Gin Leu Leu 
450 

7-5 1" Arg Ala Cys 



■^z Hi? Asn Glu 



Leu Gly Lys His 
500 

Ser Val Arg As? 
515 

Val Pro Aia Ala 
530 

Leu His Trp Leu 

C ,1 - 
■J ~J _/ 

Phe Tyr Val Thr 



Arg Pro Ser Val 
580 



Arg Pro Ala Glu 
230 

Arg His Ser Kis 
295 

Ser Thr Ser Arg 
310 

Tyr Ala Glu Thr 
325 

Leu Arg Pro Ser 



Ala Arg Arg Leu 
360 

Pro Gly Thr Pro 
375 

Met Arg Pro Leu 
■ 390 

Tyr Gly Val Leu 
405 

Pro Ala Ala Gly 



Ala Pro Glu Glu 

4 40 

Arg Gin His Ser 
4 55 

Leu Arg Arg Leu 
4 7 0 

Arg Arg Phe Leu 
435 

Ala Lys Leu Ser 



Cys Ala Trp Leu 
520 

Glu His Arg Leu 
535 

Met Ser Val Tyr 
550 

Glu Thr Thr Phe 
565 

Trp Ser Lys Leu 



Glu Ala Thr Ser 



Pro Ser Val Gly 
300 

Pro Pro Arg Pro 
315 

Lys His Phe Leu 
330 

Phe Leu Leu Ser 
345 

Val GTu Thr lie 



Arg Arg Leu Pro 
330 

Phe Leu Glu Leu 
395 

Leu Lys Thr His 

4 1 Gr 

Val Cys Ala Arg 
425 

Glu As? Thr Asp 



Ser Pro Trp Gin 
4 60 

Va 1 . ? ro P ro Gly 
4 7 5 

Arg Asn Thr Lys 
4 90 

Leu Gin Glu Leu 
505 

Arg Arg Ser Pro 



Arg Glu Glu He 
540 

Val Val Glu Leu 
555 

Gin Lys Asn Arg 
570 

Gin Ser lie Gly 
535 



Leu Glu Gly Ala 
285 

Arg Gin His His 



Trp Asp Thr Pro 
320 

Tyr Ser Ser Gly 
335 

Ser Leu Arg Pro 
350 

Phe Leu Gly Se r 
365 

Arg Leu Pro Gin 



Leu Gly "Asn His 
400 

Cys Pro Leu Arg 
415 

Glu Lys Pro Gin 
430 

Pro Arg Arg Leu 
4 4 5 

Val Tyr Gly Phe 



Leu Trp Gly Ser 

4 3 0 

Lys Phe lie Ser 
495 

Thr Trp Lys Met 
510 

Gly Val Gly Cys 
525 

Leu Aia Lys Phe 



Leu Arg Ser Phe 
560 

Leu Phe Phe Tyr 
575 

lie Arg Gin His 
590 
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W Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 600 605 

H.s Glu Ala Arg Pro Ala Leu Leu- Thr Ser Arg Leu Arg Phe lis 

610 615 620 

?ro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

trq Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

or Q G iy Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg 
6 75 680 _ " 685 

-'a T-rp Arc Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 ' 695 7 00 

<~i,. r« u Tv- Val Lys Val Aso Val Thr Gly Ala Tyr Asp Thr lie 

7oi '■' " ' 710 715 720 

-o Gin Aso Arg Leu Thr Glu VaL He Ala Ser lie lie Lys Pro Gin 
725 730- " 733 



.„ ~„ r t..- -vs Val arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 

* 740 745 7 50 

---is va" Arg Lvs Ala Phe Lys Ser His Val Ser- Tar Leu Thr Asp 

755 ' 760 765 

--■■ Gin **o Ty- hoc Arg Gin. Phe Val Ala His Leu Gin Glu Thr Ser 

770 "* 775 730 



■z Aso Ala Val Val lie Glu Gin Ser Ser Ser Leu Asn Glu 
7 50 



795 800 



•'- S — S-* Giy Leu Phe Aso Val Phe Leu Arg Phe Met Cys His Kis 

" " 805 - 810 315 

zi* Val He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro 

' 320 325 830 



Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
83 5 840 • 845 

y-t Glu Asn. Lys Leu Phe Ala Gly He Arg Arg Asp Giy Leu Leu Leu 
850 355 860 

Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 
o« 870 375 880 

KJ '.J -J 

r v3 Thr PT.e Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys 
835 890 895 

Va i Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu 
900 905 910 
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Ala Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Vai Gin Ser 
530 935 - 940 

Aso Tyr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Val Thr Phe 
94*5 950 955 - 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
965 970 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
930 935 990 

Ser Leu Gin Thr Val Cys Thr Asn lie Tyr Lys Ile_ Leu Leu Leu Gin 
995 - 1000 ' 1005 

Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 1015 1020 

V^'^T^-p Lvs Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp- Thr Ala 
1025 ' ' 1030 . 1035 1040 

Ser Leu Cvs Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 
1045 1030 1055 

. C-lv Ala Lvs Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Vai Gin Trp 
1060 1065 1070 

Leu Cvs His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 
1075 1030 1085 

" - Val Leu Leu Glv Ser Leu Arg Thr'Ala Gin Thr Gin Leu Ser 

-Jy}""^ " 1095 - 1100 

■ Lvs Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn 

■ ; ; 5 11 10 HI 5 1120 

Ala Leu Pro Ser Aso Phe Lys Thr lie Leu Asp 
1125 * U30 



INFORMATION TOR SEQ ID MO: 34 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE; amino acid 

(C) ST RAND EON ESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) EE AT U RE: 

(A) NAME /KEY : Modi f ied-si te 

i^i ... . 

{U} /no r - e = "Xaa - .hydrophooj. c amino acid 

A l*, Le U/ rie, Val, Pro/ Ph8/ Tro 
0 t Met" 



231 



(Ik) FEATURE: 

(A) NAME/KEY: Modi f ied- s i t e 
(S) LOCATION: 5 

(D) OTHER INFORMATION : /product* "OTHER" 

/note» "Xaa - Phe or Tyr" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:345: 

Xaa Xaa Asp Asp Xaa 
1 5 



(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 
{ A) LENGTH: 5 amino acids 
(8) TYPE: amino acid 
(C) STRANDEDNESS : 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i;<) FEATURE: 

(A) NAME/ KEY: Mod i f ied- s i tie - 
(3) LOCATION: 1 

( 0) OTHER INFORMATION: /products "OTHER" 

/note= "Xaa = Phe or Tyr" 

(ix) FEATURE: 

(A) NAME/ KEY : Modi £ ied- s i t e 
(3) LOCATION: 5 

(0) OTHER INFORMATION: /product- "OTHER" 

/ no = "Xaa = hydrophobic araino acid 
Ala, Leu, -He, Vai, Pro, Phe, Tr? 
or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ IO_NO:346: 

Xaa Xaa Asp Asp Xaa 
1 ' 5 



(2) INFORMATION FOR SEQ ID NO : 3 <i 7 : 

(i) SEQUENCE CHARACTERISTICS : ' 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

Trp Xaa Gly Xaa Ser Xaa 
i 5 
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(2) INFORMATION FOR SEQ ID MO: 34 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES3 : 

(D) TOPOLOGY: .linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME /KEY: Modi f ied-si te 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- '-Xaa * hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /product- "OTHER" 

/note« "Xaa. « hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Mer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 
Xaa Leu Gly Xaa Xaa Xaa 



INFORMATION FOR SEQ 10 MO: 349- 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 a m i no " ac ids 
(3) TYPE: artun.o acid 

(C) STRAMDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) HAME/KEY: Modi f ied-si te 
(3) LOCATION: 7 . . 8 

CD) OTHER INFORMATION: /product^ "OTHER" 

/note^ "Xaa - hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

(ix) FEATURE: 

(A) NAME/KEY: Modi fled- s i te 
(3) LOCATION: 10 - .11 

{ D) OTHER , INFORMATION : /product- "OTHER" 

/note= "Xaa - hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 
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(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 
(3) LOCATION: 19 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asa or Gin" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 
(3) LOCATION: 2 3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 4 9 : 

Tro Leu Xaa Tyr Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Thr 
1 5 ~ 10 ' ' 15 

Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Arg Lys Xaa Xaa 
20 25 30 

Tro 



INFORMATION FOR SEQ ID MO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 4 amino acids 
'3; TYPE: arnir.o acid 
■;C) STRAMOEDNESS : 
[0) TOPOLOGY: linear 



ii) MOLECULE TYPE: peptide 



{ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s i te 
(3) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = Leu or lie" 

(ix) FEATURE: 

(A) NAME / KEY : Modi f ied-s i te 
(3) LOCATION: 7 . .3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa - hydrophobic amino acid/ 
Ala, Leu, lie, Val, Pro, Phe, Tro 
or Met" 
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(ix) FEATURE : 

(A) NAME /KEY : Modi f i ed- s i te 

(B) LOCATION: 10 . . 11 

(D) OTHER INFORMATION : /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, Trp 
or Met" 



(ix) FEATURE: 

(A) NAME/ KEY: Modi f ied-si te 

(B) LOCATION: 12 

(0) OTHER INFORMATION: /product- "OTHER" 

./note- "Xaa - Gin o.r Arg" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 
(3) LOCATION: 13 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 

(ix) FEATURE ; 

(A) NAME /KEY: Modi f ied-s ite 
(3) LOCATION: 21 
" (D). OTHER INFORMATION: /product- "OTHER" 

/ note - "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-s ite . 
(3) LOCATION: 2 5 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa « polar amino acic, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 

(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-s ite 
(3) LOCATION : 23 . . 29 

(0) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa = Phe or Tyr" 

(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-s i t e 

(B) LOCATION: 31 

(D) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa = Lys or His" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350:' 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Tyr 
.1 5 10 ■ lz 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa 
20 25 30 

Xaa Trp 
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INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-s i te 

(B) LOCATION: 2 

(D) OTHER INFORMATION: /product- "OTHER 11 

/note= "Xaa = Leu or ILe M 

(i;<) FEATURE: 

(A) NAME /KEY : Modif ied-si te 

(B) LOCATION: 7.. 8 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note- "Xaa ~ hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 
or Met" 

;ix) FEATURE: 

(A) NAM'S / KEY : Modi f ied-s ite 
(3) LOCATION : 10 . . II 

<D) OTHER INFORMATION: /product-* "OTHER" 

/note= "Xaa ~ hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, Trp 



■ A) NAME/KEY: Mod i £ ied-s i - e 
;3) LOCATION: 12 

{ 0 ) OTHER INFORMATION: /product- "OTHER" 

/nc:^ "Xaa = Gin or Arg" 



(A) NAME/KEY: Modif ied-s ite 
(3} LOCATION; 13 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 



(A) NAME/ KEY : Modi f ied-s it e 
(3) LOCATION: 21 

(D) OTHER INFORMATION: /prcduct= "OTHER" 

/note- "Xaa - polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 

(i:<) FEATURE: 

(A) NAME/ KEY : Modi f ied-s its 

(B) LOCATION: 25 

(D) OTHER INFORMATION: /prcduct= "OTHER" 

/note* "Xaa = polar amino acid, Gly, 
Ser, Thr, Tyr, Cys, Asn or Gin" 
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(ix) FEATURE: 

(A) NAME /KEY : Modi f i ed- s i te 

(B) LOCATION: 29. .30 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa.= Phe or Tyr' 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-site 

(B) LOCATION: 3 2 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = Lys or His' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35L: 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Tyr 
1 5 10 15 

Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa 
20 25 30 

Xaa Xaa Trp - 
35* 



INFO P.MAT ION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNES5 : 

CO) TOPOLOGY: linear 

fii) MOLECULE TYPE: peptide 



{ xi ) - SEQUENCE DESCRIPTION: SEQ ID NO:352: 

Tro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa- Phe Phe Tyr 

1 .5 10 15 

Xaa Thr Giu Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa 

20 25 30 

Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa He 
3 5 4 0 



(2) INFORMATION FOR SEQ ID NO: 35 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Phe Tyr 
1 5 10 15 



Xaa Thr Glu Xaa Xaa Xaa Xaa Xaa 
20 

Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa 
35 40 



(2} INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) 



(xi) 

Glu Xaa Xaa Val Xaa 
L - 5 



(2) INFORMATION FOR SEQ ID MO: 3 55: 

(i) . SEQUENCE CHARACTERISTICS : 

(A) LENGTH: LI amino acids 
( S ) TYPE: amino acid 
■C) ST RAN DE DM ESS : 
(D) TOPOLOGY : Linear 



Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa 
25 30 

Xaa Xaa lie 



MOLECULE TYPE: peptide 

SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



(ii) MOLECULE TYPE: peptide 



(y.i) SEQUENCE DESCRIPTION: SEQ ID MO: 355: 

Xaa Xaa Xaa Arg Xaa Xaa Pro Lys Xaa Xaa Xaa 
I _ 5 10 



INFORMATION FOR SEQ ID NO: 35 6: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 356: 
Xaa Arg Xaa lie Xaa 
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INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 

Xaa Xaa Xaa Xaa Phe Xaa- Xaa Xaa Asp Xaa Xaa Xaa Xaa Tyr Asp Xaa 
1 5 10 15* 



SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 amino acids 
■ (B) TYPE : amino acid 

(C) STRANDEDMESS : 

(D) TOPOLOGY : linear 

MOLECULE TYPE: peptide- 



SEQUENCE DESCRIPTION: SEQ ID NO: 353: 



Xaa 

(2) INFORMATION FOR SEQ ID NO: 358: 
(i) 

(ii) 
(xi) 

Tyr Xaa Xaa Xaa Xaa Gly Xaa Xaa 
5 

Xaa Xaa Xaa Xaa Xaa Xaa 
20 



Gin Gly X a a Xaa X a a S e r Xaa Xaa 
lo' _ 15 



INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDMESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 



Xaa Xaa Xaa Xaa Xaa Xaa Asp Asp Xaa Lea Xaa Xaa Xaa 
1 5 10 
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(2) INFORMATION FOR SEQ ID MO: 360: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

Phe Phe Tyr Xaa Thr Glu 
1.5 



(2) INFORMATION FOR SEQ ID NO: 361: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
. (D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:361: 

Phe Phe Tvr Val Thr Glu 
i * 5 



12* INFORMATION FOR SEQ ID MO: 362: 

(i) SEQUENCE CHARACTERISTICS: 

;a; LENGTH : 17 base pairs 

(3) TYPE: nucleic acid 

.(C) STRANDEDNESS: single 

(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(y.i) SEQUENCE DESCRIPTION: SEQ ID MO:362: 

17 

TTYTTYTAYG TNACNGA 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 
TCMGTNACRT ARAARAA 



(2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6. amino acids 
( 3 ) TYPE: amino acid 

(C) STRANDEDNESS : 

( D) TOPOLOGY: .linear 

(ii) MOLECULE TYPE: peptide 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

Arg Phe lie Pro Lys Pro 
1 5 



(2) INFORMATION FOR SEQ ID NO: 365: 

(i) .SEQUENCE" CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ON A 



<:.;i) SEQUENCE DESCRIPTION: SEQ ID NO: 365 



(2) INFORMATION FOR SEQ ID NO: 3 66: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
(2) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36 
GG YTTNGGDA TRAAMC 



;2) INFORMATION FOR SEQ ID NO:367: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5. amino acids 
(3) TYPE : amino acid 

(C) STRANDEDNESS: 

(D) 'TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 

Ala Tyr Asp Thr lie 

1 5 



(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Mi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 
GCNTAYGAYA CM AT 



IN FORMATION FOR SEQ ID NO: 3 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAND EDM ESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



SEQUENCE DESCRIPTION:" SEQ ID NO: 369 
TANG7RTCRT ANGC 



INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 



Gly He Pro Gin Gly 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 37 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 'base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 
GGMATHCCNC ARGG 



(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(x.i) SEQUENCE DESCRIPTION: SEQ ID MO:372 
SWMCCYTGNG GDATNCC 



INFORMATION FOR SEQ ID NO: 37 3: 

(i) SEQUENCE CHARACTERISTICS: 
{A; LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
(0) -TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 373 

Leu Val Asp Asp Phe Leu 
1 ' 5 



(2) INFORMATION FOR SEQ ID NO: 37 4: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 
YTNGTNGAYG AYTTYYT 



(2) INFORMATION FOR SEQ ID NO: 37 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

Asp Aso Phe Leu Leu Val Thr 
1 ^ 5 



(2) INFORMATION FOR SEQ ID MO: 37 6: 

( 1 ) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nuclei-c acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



{:<!} sequence description: seq id mo: 37 6: 

jt::acnari;a rraartcrtc 



INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
■(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:0 FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /note= "TCP1- 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 377: 
GTGAAGGCAC TGTTCAGCG 
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(2) INFORMATION FOR SEQ ID NO: 378: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) ' STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME/ KEY: - 

(B) LOCATION : 1..20' 

(D) OTHER INFORMATION: /note= ,f TCP1.15 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37 8 : 
CGCGTGGGTG AGGTGAGGTG 



(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DMA 



:) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . .'21 

(Dl OTHER INFORMATION: /not:e= "TCP1.14 prirrv 



SEQUENCE DESCRIPTION: SEQ ID MO:379: 
TGCTGGG CCTGGACGAT A 



(2) INFORMATION FOR SEQ ID NO: 33 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(A) NAME/ KEY: - 

(B) LOCATION : 1 . . 24 

(D) OTHER INFORMATION: /note- "billTCPG primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:380: 
AGCTTGTTCT CCATGTCGCC GTAG 



(2) INFORMATION FOR SEQ ID NO:381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Ik) FEATURE: 

(A) NAME/ KEY : - 

{ B ) LOCATION: 1 . . 19 

(D) OTHER .INFORMATION: / note= " TCP 1 . 2 primer" 



{xi). SEQUENCE DESCRIPTION: SEQ ID NO: 381: 
G T GG AT G ATT TCTTGTTGG 



i2) INFORMATION FOR SEQ ID NO: 33 2: 

(!) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
' (3) TYPE: nucleic acid 

(C) S T RAN D E DN E S S : single 

(D) TOPOLOGY: linear 

:ii) MOLECULE TYPE: DNA 



{!:■:) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 19 

(D) OTHER INFORMATION: /note- "TCP1.4 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 
CTC-GACACTC AGCCCTTGG 



(2) INFORMATION FOR SEQ ID MO: 333: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1, . 19 

(D) OTHER INFORMATION: /note- "TCPL - 5 primer" 

Cxi) SEQUENCE DESCRIPTION; SEQ ID NO: 383: 
GGCAGGTGTG CTGGACACT 19 



(2) INFORMATION FOR SEQ ID. NO : 38 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



<) FEATURE: 

(A) NAME/KEY: - 

( B )" LOCATION : .1 . . 1 9 

CD) OTHER INFORMATION: /note 3 " ,f TCPl . 6 primer" 

(:<:i) SEQUENCE DESCRIPTION: SEQ ID MO: 38 4: 
rGATGATG CTGGCGATG 19 



INFORMATION FOR SEQ ID NO: 335: 

(i; SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
(C) • STRAMDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /note= "TCP1.7 primer" 



{>:!} SEQUENCE DESCRIPTION: SEQ 10 NO: 335: 
GGGGCTCGTC TTCTACAGG 
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(2) INFORMATION FOR SEQ I D NO: 38 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

• (ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

( B ) LOCATION: 1 . . 19 

(0) OTHER INFORMATION: /note= "TCP1.8 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 386: 
CAGCAGGAGG ATCTTGTAG 



(2) INFORMATION FOR SEQ ID NO: 337; 

(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 19 base, pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DNA 



FEATURE : 

'■A) NAME / KEY : - 

(3) LOCATION: 1 . . 19 

(0) OTHE-R INFORMATION: /note* "TCP1.9 primer 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 387: 
TGACCCCAGG AGTGGCACG 



(2) INFORMATION FOR SEQ ID NO: 388: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /nots= "TCP1 . 10 prime 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 
rCAAGCTGAC TCGACACCG 19 

(2} INFORMATION FOR SEQ ID NO: 389: ■ 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY: ■ linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

( B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note- "TCPl.ll primer" 



(:<i) sequence description: seq idno:389: 
;ggcgtgaca ggggtgc 17 

;2j information for seq id mo: 390: 
(i) sequence characteristics: 

(A) LENGTH: 13 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(0) TOPOLOGY : linear 

(ii) MOLECULE: TYPE: DMA 



;;•:) FEATURE : 

(A) NAME /KEY: - 

(3) LOCATION: I . . 18 

(D) OTHER INFORMATION: /not" "TCP1.12 prime: 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO;390:. 
GCTGAAGGCT GAGTGTCC 18 

(2) INFORMATION FOR SEQ ID MO: 391: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/ KEY : - 

<B) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /note- "TCP1.13 primer"- 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 391: 
TAGTCCATGT TCACAATCG 19 



(2) INFORMATION FOR SEQ ID NO: 392: 

(i) SEQUENCE CHARACTERISTICS:- 
(A) LENGTH: 19 base pairs 
(3} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /note- "TCPl . 16 primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 
TTTCCGTGTT GAGTGTTTC ■ - 19 

'2} INFORMATION FOR SEQ ID MO: 39 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 base pans 
! H i TYPE : p.uci-3ic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(:.;■;) FEATURE: 

(A) MAMS /KEY: - 

(3} LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /note* "TCPl . 17 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
GTCACCGTGT TGGGCAGG ' '18 
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(2) INFORMATION FOR SEQ .ID NO:394: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

<B) LOCATION : 1 . . 18 

( D ) - OTHER INFORMATION: /note- "TCP1.19 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 
GCTACCTGCC CAACACGG 



(2) INFORMATION FOR SEQ ID NO: 39 5: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 



1 . . 1 3 

ORMATI'CN: /note= "TCP1.20 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 
G C G C G AAG AA CGTGCTGG 



(2) INFORMATION FOR SEQ ID NO: 3 96: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
{ B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(A) NAME/KEY: 
(3) LOCATION: 
(0) OTHER INF 



(A) NAME/ KEY: - 

(B) LOCATION: 1 , . 19 

(D) OTHER INFORMATION: /note- "TCP1.21 primer" 
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{xi) SEQUENCE DESCRI PT IOlN : SEQ ID MO:396: 
CACTGCTCCT TGTCGCCTG 



(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEONESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i;<) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 19 

(D) OTHER INFORMATION: /note= "TCP1.22 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:397: 
TCCCAAGGA CTTTGTTGC 



(2) INFORMATION" FOR SEQ ID NO:398: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
;C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

- -;ii) MOLECULE TYPE: DMA 



(Lx; FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 19 

( 0 } OTHER INFORMATION: /note- "TCP 1 * 2 4 prirr.er 



(;<i) SEQUENCE DESCRIPTION: SEQ ID NO: 393: 
TGTTCCTCAA GACGCACTG 



(2) INFORMATION FOR SEQ ID NO: 399: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEONESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . , 19 

(D) OTHER INFORMATION: /note= "TCPl . 25 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: . 
TACTGCGTGC GTCGGTATG 19 



INFORMATION FOR SEQ ID. NO: 400: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 19 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: "1 . . 19 

C'D) OTHER INFORMATION: /note= "TCP1/26 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400: 
GGTCTTGCGG CTGAAGTGT 19 

;2} INFORMATION FOR SEQ ID NO: 4 01: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base" pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY: - 

( B ) LOCATION: 1 . . 19. 

(D) OTHER INFORMATION: /note- "TCP1.27 primer" 

. (xi) SEQUENCE DESCRIPTION: SEQ ID MO: 401: 
TGGTTCACCT GCTGGCACG 19 
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(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 

(C) S T RAN DE ONES S : single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1..19 

(D) OTHER INFORMATION: /note- "TCP1.28 prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
GTGGTTTCTG TGTGGTGTC 



(2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
(C) STRAMDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



NAME /KEY: - 
LOCATION: i - . 19 

OTHER INFORMATION: /note= "TCP1.29 prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:403: 
G A C AC C AC A C AG AAA C C AC 



(2) INFORMATION FOR SEQ ID NO: 4 04: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
(C) STRAMDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i:-:) FEATURE : 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 19 

(0) OTHER INFORMATION: /note- "TCP1.30 prime 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 0 4 : 
GTGCCAGCAG GTGAACCAG 



(2) INFORMATION FOR SEQ ID MO: 4 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(8 ) LOCATION: 1..19 

(D) OTHER INFORMATION: /note= "TCP1.32B primer" 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO:405: 
GCAGTGCGTC TTGAGGAGC . " 



(2) INFORMATION FOR SEQ ID NO: 406:' 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



. } c 1 -*. l -J rlc : 

(A) NAME /KEY: - 

(3) LOCATION : 1 . . 21 

(D) OTHER INFORMATION: /note= "TCP1-33 prir 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 



lGGAACCATA GCGTCAGGGA G 21 



(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE : 

(A) NAME/KEY : - 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /note- "TCP1.34 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 
GGCCTCCCTG ACGCTATGGT T 



(2) INFORMATION FOR SEQ ID NO: 4 03: 

(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

{3} LOCATION: 1 . .20 

(D) OTHER INFORMATION : /note= "TCP1 . 35 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 
GCKCGGCGCT GCCACTCAGG 



[2) INFORMATION FOR SEQ ID NO: 4 09: 

(i) SEQUENCE CHARACTERISTICS: 
(A)- LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /noce* "TCP1.35t prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 409: 
GCTCGGCGCT GCCACTCAGG 
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(2) INFORMATION FOR SEQ ID MO: 410: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAlME/KEY: - - 

(3) LOCATION : 1 . . 20 

(D) OTHER .INFORMATION : /note- "TCP1.36 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:410: 
ACGCCGAGAC CAAGCACTTC 20 

(2) INFORMATION FOR SEQ ID NO : 4 1 1 : 

(i) SEQUENCE CHARACTERISTICS : ' 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
( D) TOPOLOGY : "linear 

( ii) MOLECULE TYPE : DMA 



^AT'JRE: 

'A) MAMS /KEY: - 

{3) LOCATION: 1- - 20 

iO) OTHER INFORMATION: -/noce* "TCP1.38 pnr.~ 



(xi) SEQUENCE DESCRIPTION: SEQ I D MO : 4 1 1 : 
CCAAAGAGG7 GGCTTCTTCG 2 ^ 

(2) INFORMATION FOR SEQ ID MO: 4 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
{3} TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



;ix) FEATURE : 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note- "TCP1 . 39 primer' 
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) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . . 19 

(D) OTHER INFORMATION: /note- "TCP1.42 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 
GACGACG TGCTGGTTC 



INFORMATION FOR SEQ ID NO: 4 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DMA 



) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION : 1 . . 19. 

(D) OTHER I N FORMAT I OM : /not:e= "TCP1.43 printer 



(:<:i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 16: 
TCAGGGG CAGCGCCAC 



INFORMATION FOR SEQ ID NO: 4 17: 

'[i) SEQUENCE CHARACTERISTICS: 

(A.) LENGTH: 19 base pairs ~ 
(3) T'i'PE: nucleic acid 
(C) STRANDEDNESS: single 
. ( D } TOPOLOGY: linear 

(11) MOLECULE TYPE: DMA 



FEATURE: 

(A) NAME/ KEY : - 

(8) LOCATION : 1 . . 19 

(D) OTHER INFORMATION: /no:e= "TCP1.44 pritaer 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 417 
SGCAGGTG TACGGCTTC 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:412: 
AAGGCCAGCA CGTTCTTCGC 2 0 



(2) INFORMATION FOR SEQ ID NO: 4 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i:<) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1. . 19 

(D) OTHER INFORMATION: /note- "TCP1.40 primer" 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 13: 
CACGTTCGTG CGGCGCCTG 19 



{2} INFORMATION FOR SEQ ID NO: 4 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 
■:C) STRAMDEDNESS: sincie 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



FEATURE: 

(A) MAlME/KEY: - 

■3) LOCATION : 1 . . 19 

{ D ) OTHER INFORMATION: /note= "TCP1.4 1 o rimer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 4 : 
XTTCACCAC CAGCGTGCG 19 



(2) INFORMATION FOR SEQ ID NO : 4 1 5 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 base pairs 
(3) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(2) INFORMATION FOR SEQ ID NO:418: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 25 

(D) OTHER INFORMATION: /note= "TCPl, 45 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 
GCGTGGACCG AGTGACCGTG GTTTC 2 5 



INFORMATION FOR SEQ ID NO : 4 1 9 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 base pairs 
(3) TYPE: nucleic acid' 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii) MOLECULE TYPE: DNA 



FEATURE: 

{A) NAME /KEY: - 

(3) LOCATION : 1 . . 21 

(D) OTHER INFORMATION: /note- ' "TCPl . 4 6 primer' 



xi) SEQUENCE DESCRIPTION: SEQ ID MO: 4 19: 

VGG CCGCGATGTG G 21 



■ -J 1 w -J 



(2) INFORMATION FOR SEQ ID NO: 420: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(A) NAME/ KEY: - 

(S) LOCATION : 1 . . 22 

(D) OTHER INFORMATION: /note- "TCPl. 47 primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 
GAAGTCTGCC GTTGCCCAAG AG 



(2) INFORMAT ION FOR SEQ ID NO: 421: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 25 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 25 

(D) OTHER INFORMATION:. /note = "TCP1.48 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
GACACCACAC AGAAACCACG GTCAC 



(2) INFORMATION FOR SEQ ID MO: 4 22: 

(i) SEQUENCE CHARACTERISTICS: 
-(A) LENGTH: 21 base pairs 
(3) TYPE:' nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(A) NAME /KEY: - 

(3) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note- "TCPI.49 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422: 
CGCCCCCTCC TTCCGCCAGG T 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) ' NAME/KEY : - 

(3) LOCATION: 1 . . 25 

{ D ) OTHER INFORMATION: /note- "TCP1 . 50 primer" 

(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 423: 
CGAAGCCGAA GGCCAGCACG TTCTT 2 5 



(2) INFORMATION FOR SEQ ID NO: 424: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION - : 1 . . 22 

(D) OTHER INFORMATION: /note- "TCP1.51 prirr.er" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 
GGTGGCCCGA GTGCTGCAGA GG 2 2 

[2; INFORMATION FOR SEQ ID NO: 4 25: 

{■ii SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
( 5) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1. .25 

(D) OTHER INFORMATION: /note= "TCP1.52 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 
GTAGCTGCGC ACGCTGGTGG TGAAG 2 5 
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(2) INFORMATION FOR SEQ ID NO : 4 2 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY : - 

( 3) LOCATION : 1. .22 

(D) OTHER INFORMATION: /note* "TCP1.53 primer" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 2 6: 
TGGGCGACGA CGTGCTGGTT CA 2 2 



(2) 'INFORMATION FOR SEQ ID NO: 427: 



(i) SEQUENCE CHARACTERISTICS:" 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear " 

(ii) MOLECULE TYPE: DMA 



(A) NAME/ KEY : - 

;=) LOCATION: I . .2 5. 

{ D ) OTHER INFORMATION: /note-" "TCP! . 54 orim 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 2 7 : 
TATGGTTCCA GGCCCGTTCG CATCC 2 5 



(2) INFORMATION FOR SEQ ID NO: 4 23: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single' 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1. .23 

( 0) OTHER INFORMATION: /note= "TCP1.55 primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
CCAGCTGCGC CTACCAGGTG TGC 2 3 

(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: L..2 5 

(0) OTHER INFORMATION: /note- "TCP1.56 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 
GGCCTCCCTG ACGCTATGGT TCCAG 2 5 



(2) INFORMATION FOR SEQ ID MO: 4 30: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(u) MOLECULE TYPE: DMA 



(!:■;) FEAT U RE : 

(A) NAME /KEY-: - 

(3) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /noce= "TCP1.57 primer" 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO:430: 
GGTGCTGCCG CTGGCCACGT TCG 2 3 

(2) INFORMATION FOR SEQ ID MO : 4 3 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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fix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION : 1. .25 

(D) OTHER INFORMATION: /note- "TCP1.58 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 
TCCCAGGGCA CGCACACCAG GCACT 2 5 



(2) INFORMATION FOR SEQ ID NO: 432: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



li>\) FEATURE : 

{A} NAME /KEY: - 

(3) LOCATION: 1 . . 25 

'(D) OTHER INFORMATION: /note- "TCP1.59 primer 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 432: 
GTACAGGGCA CACC-TTTGGT CACTC 



INFORMATION FOR SEQ ID NO: 4 33: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 25 base pairs 
,(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 , , 25 

(D) OTHER INFORMATION: /note- "TCP1.60 prim^ 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:433 
TCGACGACGT ACACACTCAT CAGCC 



25 
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(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii)- MOLECULE TYPE: DMA 



(ix) . FEATURE: 

(A) NAME /KEY : - 

( B ) LOCATION: 1..2 5 

■CD) OTHER INFORMATION: /note* "TCP1.61 primer" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 34: 
AGCGGCAGCA CCTCGCGGTA GTGGC 



(2) INFORMATION FOR SEQ ID NO: 4 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(8) LOCATION : 1 . , 2 5 

(D) OTHER INFORMATION : /ncn« "TCP1.62 primer" 



(:<:i) SEQUENCE DESCRIPTION: SEQ ID MO: 435 
CCACCAGCTC CTTCAGGCAG GACAC 



(2) INFORMATION FOR SEQ ID NO : 4 3 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



{!:•:) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..2 5 

(D) OTHER INFORMATION: /noce= "TCP1.63 primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:436: 
CCAGGGCTTC CCACGTGCGC AGCAG 



(2) INFORMATION FOR SEQ ID MO: 437: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 25 

(D) OTHER INFORMATION: /note- "TCP1.64 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:437: 
CGCACGAACG TGGCCAGCGG CAGCA . " 



(2) INFORMATION FOR SEQ ID NO:433: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION : 1 . . 23 

(D) OTHER INFORMATION: /note= "TCP1.65 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 
TGACCGTGGT TTCTGTGTGG TGT 



(2) INFORMATION FOR SEQ ID NO: 4 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1. .25 

(D) OTHER INFORMATION: /note- "TCPl . 66 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 
CCCTCTTCAA GTGCTGTCTG ATTCC 25 



(2) INFORMATION FOR SEQ ID NO: 4 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE ; 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note.- "TCPl, 67 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ I D MO : 4 4 0 : 
ATCGCGGCCA CCACGTCCCT 20 



(2) INFORMATION FOR SEQ ID NO : 4 4 I : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 5 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
{ D } TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1. . 25 

(D) OTHER INFORMATION: /note- "TCPl . 68 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 1 : 



TGCTCCAGAC ACTCGGCCGG TAGAA 
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(2) INFORMATION FOR SEQ ID NO : 4 4 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 19 

(D) OTHER INFORMATION: /note= "TCP1.69 primer" 

<xi) SEQUENCE DESCRIPTION: SEQ IDNO:442: 
ACGAAGCCGT ACACCTGCC 19 



(2) INFORMATION FOR SEQ ID NO : 4 4 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

{ A) NAME /KEY : - 

(3)" LOCATION: 1 . . 25 

( D )■ OTHER IN FORMATION : / no ce=» ."TC?1 . 72 primer" 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO:443: 
CGACATCCCT GCGTTCTTGG CTTTC 



(2) INFORMATION FOR SEQ ID MO : 4 4 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

■ (A) NAME /KEY : - 

(8) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note= "TCP1.73 primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 4 : 



CACTGCTGGC CTCATTCAGG G 2 1 



(2) INFORMATION FOR SEQ ID NO: 44 5: 



(i) SEQUENCE CHARACTERISTICS: 

(A) - LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(BJ LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /note- "TCP1.74 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 4 4 5 : 



;atgga GAACAAGC . 18 



(2) IN FORM A.? ION FOR SEQ ID M.O : 4 4 6 : 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(5) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



Tim) FEATURE! : 

(A) NAME /KEY: - 

(3) LOCATION: 1.-.20 

(0) OTHER INFORMATION: /noce= "TCPI.75 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 6 : 
GCAGCCATAC TCAGGGACAC 2 0 



(2) INFORMATION FOR SEQ ID NO : 4 4 7 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note* "TCP1.76 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 
CCATCCTCTC CACGCTGCTC 



(2) INFORMATION FOR SEQ 10 NO: 4 48: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY:' - 

(3) LOCATION: 1. . 21 . ' • 

CD) OTHER INFORMATION: / no tie- " TC P 1 . 7 7 primer' 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 448: 
GCGATGACCT CCGTGAGCCT G - 



; 2 i I N FORMATION FOR SEQ ID NO : 4 4 9 : 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
(3) TYPE.: nucleic acid 

(C) STRAND EON ESS : single 

(D) . TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE : 

(A) NAME/ KEY: - 

(3) LOCATION: 1. . 19 

(D) OTHER INFORMATION: /note- "TCP1.73 primer" 



(xi). SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 9 : 
CCCAGGACAG GCTCACGGA 
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2) I NFORMIAT ION FOR SEQ ID NO: 450: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..24 

(0) OTHER INFORMATION: /note= "billTCPl primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:450: 
CCTCTTCAAG TGCTGTCTGA TTCC 



{2) INFORMATION FOR SEQ ID NO:451: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1. .25 

■■0) OTHER INFORMATION: /note- "billTCPZ. primer 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO : 4 5 1 : 
CAGCTCGACG AC GT AC AC AC TCATC 



(2) INFORMATION FOR SEQ ID NO: 452: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note= "biiITCP4 primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 
CTGACGTCCA GACTCCGCTT CAT 2 3 

(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..33 

(D) OTHER INFORMATION: /note- "rpprimOl primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:453: 
GACCTGAGCA GCTCGACGAC GTACACACTC ATC . " 33 

(2) INFORMATION FOR SEQ ID NO: 454: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) ST RAM DE DM ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1. . 20 

(D) OTHER IN FORMAT ION: /note- "Ltl primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454: 
GTCGTCGAGC TGCTCAGGTC 20 



(2) INFORMATION FOR SEQ ID NO: 4 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note- "Lt2 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 
AGCACGCTGA ACAGTGCCTT 



(2) INFORMATION FOR SEQ ID NO: 4 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . .20 

(D) OTHER INFORMATION: /note- "Lt3 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 
GACCTGAGCA GCTCGACGAC 



-(2} INFORMATION FOR SEQ ID MO:457; 

(i) SEQUENCE -CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) ST RAND EDM ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 20 

(D) OTHER INFORMATION: /note= "Lt4 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:457: 
AAGGCACTGT TCAGCGTGCT 
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(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .20 

(D) OTHER INFORMATION: /note- " Lc 5 primer" 
(xi) SEQUENCE" DESCRIPTION: SEQ IDNO:453: 

20 

CGGCCGAGTG TCTGGAGCAA 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1"9 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
( 0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA - 



) FEATURE: 

(A) NAME /KEY: - 

( 3 ) LOCATION :- 1 . . 19 

(Q) OTHER INFORMATION: /note= "Lc6 primer" - 
(xi) SEQUENCE. DESCRIPTION: SEQ ID MO: 4 59 : 

19 

VTGAAGCG GAGTCTGGA 



(2) INFORMATION FOR SEQ ID NO:460: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 29 base pairs 
(S) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(Q) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1- - 29 . , f 

(D) OTHER INFORMATION: /note- "BamHlLt7 primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 
ATGGATCCGT CGTCGAGCTG CTCAGGTCT 



(2) INFORMATION FOR SEQ ID NO: 4 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME/ KEY : - 

(B) LOCATION: 1..29 

(D) OTHER INFORMATION: /note- M SallLt:8 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:461: 
ATCAGCTGAG CACGCTGAAC AGTGCCTTC 



(2) INFORMATION FOR SEQ ID NO: 4 62: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(8) LOCATION: 1..2 4 

(D) OTHER INFORMATION: /note= "K303 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
GTCTCCGTGA CATAAAAGAA AG AC 

(2) INFORMATION FOR SEQ ID NO: 463: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note- "K304 primer" 
' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463: 

20 

GCCAAGTTCC TGCACTGGCT 

(2) INFORMATION FOR SEQ 10 NO: 464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) ST RAN D£ DM ESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1.-24 

(D) OTHER INFORMATION : /note- "K305 primer' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

24 

GCCTGTTCTT TTGAAACGTG GTCT * 

(2) INFORMATION FOR SEQ ID NO: 465: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: -24 base pairs 
(3) TYPE: nucleic acid 
(C~) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DMA 



(i;<) FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 24 

(0) OTHER INFORMATION: /note- "K306 primer" 



(ix) FEATURE: 

(A) NAME/ KEY : modi f ied_base 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 



/not" " N = guanosiae substituted by two 
biotin groups" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 
NCCTGTTCTT TTGAAACGTG GTCT 
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(2) INFORMATION FOR SEQ ID NO: 4 66: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:- DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .22 

(D) OTHER INFORMATION: /note* "K311 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 
GTCAAG AT GC CTGAGATAGA AC 



(2) INFORMATION FOR SEQ ID NO:.4 67 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



; Lx) - FEATURE : 

(A) NAME /KEY: - 

(3) LOCATION :. 1 . . 22 

{ D) OTHER INFORMATION: /not:e = "K312 and K313 primers" 



(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 4 6 7 : 
TGCTTAGCTT. GTGGGGGTGT CA 



(2) INFORMATION FOR SEQ ID NO:4 68: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..21 

CD) OTHER INFORMATION: /note- "K320 primer" 
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{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
GCTGCGTCCT GCTGCGCACG T 



(2) INFORMATION FOR SEQ ID NO: 4 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME / KEY ; - . 

(BJ LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note=»- "K321 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 4 69: 
- CAGCGGGGAG CGCGCGGCAT C . 



(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pai-rs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

{ii} MOLECULE TYPE: DNA - 



FEATURE: . " 

(A) NAME /KEY : - 

" (3) LOCATION : 1 . '. 21 

( D ) OTHER INFORMATION: /note- "K322 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:470: 
TGGGCCACCA GCGCGCGGAA A 



(2) INFORMATION FOR SEQ TD NO : 4 7 1 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 25 

(D) OTHER INFORMATION: /rvote- "slanti.l primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 
CGGCCGCAGC CCGTCAGGCT TGGGG 



(2) INFORMATION FOR SEQ ID NO: 472: 

(1) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
{ D) TOPOLOGY : linear 

(ii) MOLECULE TYPE : DMA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . . 24 

(D) OTHER INFORMATION: /nore= "slant;!. 2 primer" 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO: 472: 
CCGACAGCTC CCGCAGCTGC ACCC 



INFORMATION FOR SEQ ID NO : 4 7 3 : 

(L) SEQUENCE CHARACTERISTICS: 
(A~j LENGTH: 3 4 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . .34 

(D) OTHER INFORMATION: /note= "slanti.3 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 473: 
CGTACACACT CATCAGCCAG TGCAGGAACT TGGC 
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(2) INFORMATION FOR SEQ ID NO: 474: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 46 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A.) NAME /KEY : - 

(3) LOCATION: 1 . . 46 

(D) OTHER INFORMATION: /note- "slanti.4 prim 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 74 : 
CGCGCCCGCT CGTAGTTGAG CACGCTGAAC AGTGCCTTCA CCCTCG 



(2) INFORMATION FOR SEQ ID NO : 4 7 5 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
( D } TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(A) NAME/ KEY: - 

[31 LOCATION: 1 . . 39 

(D) OTHER INFORMATION: /note 3 "slanti.S prir 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 475: 
GCGGAGTCTG GACGTCAGCA GGGCGGGCCT GGCTTCCCG 



(2} INFORMATION FOR SEQ ID MO:476: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 27 base pairs 
{3} TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



ix} FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1 . .27 

(D) OTHER INFORMATION: /note= "UTR2 primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 6 : 
ATTTGACCCA CAGGGACCCC CATCCAG 



(2} INFORMATION FOR SEQ ID MO: 477: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 20 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY.: linear 

(ii) MOLECULE TYPE: DMA 



(i:<) FEATURE: 

(A) NAME /KEY : - 

(8) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note- " FW5 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 77: 
ATGACCGCCC TCCTCGTGAG- 



(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTER I ST ICS : 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid. 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE!: DMA 



FEATURE : . 

(A) NAME/ KEY : - 

(B) LOCATION : 1 . . 17 

(D) OTHER INFORMATION: /note- "Marul prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 73: 
GCCACCCCCG CGATGCC 



(2) INFORMATION FOR SEQ ID NO: 47 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



282 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 17 

(D) OTHER IN FOREST ION: "/note* "Nam2 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 
AGCCCTGGCC CCGGCCA 



(2) INFORMATION FOR SEQ ID NO: 430: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION : 1 . . 17 

(D) OTHER INFORMATION: /note- "Mam3 primer" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID MO: 4 80: 
TCCCACGTGC GCAGCAG 



(2) INFORMATION FOR SEQ ID NO : 4 3 I : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 17 base pairs 
(3) TYPE: nucleic acid 
.(C) STRANDEDMESS : single 
(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 17 

(D) OTHER INFORMATION: /note- "Mam4 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431: 
AGCAGGACGC AGCGCTG 
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(2) INFORMATION FOR SEQ ID NO: 482: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs - 

(B-) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ■ 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 34' 

(0) OTHER INFORMATION: /note- **PE01 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 
CGCGGTAGTG GCTGCGCAGC AGGGAGCGCA CGGC 34 

(2) INFORMATION FOR SEQ ID NO: 4 33: 

(if SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucheic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



:) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 35 

(D)~ OTHER INFORMATION: /n.o:e= "PE02 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 3 3 : 
-VGGGCTTC CCACGTGCGC AGCAGGACGC AGCGC 35 



[2] INFORMATION FOR SEQ ID NO: 4 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: ~ 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: /noce= "LM101 ? 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:484: 
CTAGTCTAGA TCRCTAGCGT AATCTGGAAC ATCGTATGGG TRTCCAGGAT GGTCTTGAAG 

TC 

(2) INFO RMATION EOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 39 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . . 39 

(0)- OTHER INFORMATION: /note- "LM103 primer 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 
TACCATGGGC TACCCATACG ACGTTCCAGA TTACGCTCA " 



(2) INFORMATION EOR SEQ ID MO: 436: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
;3) TYPE: nucleic acid 
■■C) STRANDEDMESS : single 
,0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 39 , 

(D) OTHER INFORMATION:- /note- "LM104 primer 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 8 6 : 
TATGAGCGTA ATCTGGAACG TCGTATGGGT AGCCCATGG 



(2) INFORMATION FOR SEQ ID MO: 487: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 45 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 45 

(D) OTHER INFORMATION: /note- "LM105 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 
GTGTACGTCG TCGAGCTCCT CAGGTCTGCC TTTTATGTCA CGGAG 



(2) INFORMATION FOR SEQ ID NO:488: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 48 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 48 

(D) OTHER INFORMATION: /note= "LM106 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 4 8 8 : 
GTGTACGTCG TCGAGCTCCT CAGGTCTTTC GCTT ATGTCA CGGAG AC C 



2) INFORMATION FOR SEQ ID NO : 4 8 9 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:<) FEATURE : 

(A) NAME /KEY: - 

(8) LOCATION: I . . 4 8 ■ 

(D) OTHER INFORMATION: /note= "LM1Q7 prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 
CCTCAGGTCT TTCTTTGCTG TCACGGAGAC AACGTTTCAA AAGAACAG 
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(2) INFORMATION FOR SEQ ID NO: 4 90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
(Dj TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:<) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1. .43 

(D) OTHER INFORMATION: /note- "LM108 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 490: 
GGTCTTTCTT TTATGTCGCG GAGACAACGT TTCAAAAGAA CAG 



(2). INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: DMA 



NAME /KEY: - 
LOCATION : . 1 . . 4 0 

OTHER INFORMATION: /note- "LMI09 primer 



. (v.i) SEQUENCE DESCRIPTION: .SEQ ID NO : 4 9 1 : 
CTTTCTTTTA TGTCACGGCG ACAACGTTTC AAAAGAACAG 



(2) INFORMATION FOR SEQ ID MO: 4 92: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(iv) feature:: 

(A) NAME/KEY: - 

(3) LOCATION : 1 . . 63 

(D) OTHER .INFORMATION : /note- *'LM_FFYTE pri 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 4 92; 
ATGAGTGTGT ACGTCGTCGA GCTCCTCAGG TCTACCACGT TTCAAAAGAA CAGGCTCTTT 



(2) INFORMATION FOR SEQ ID NO: 493: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
{3} TYPE: nucleic- acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 25 

(D) OTHER IN FORMATION : /note^ "TCP061 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 
GGCTGATGAG TGTGTACGTC GTCGA 



(2) INFORMATION FOR SEQ ID MO: 4 94: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(3) TYPE: nucleic acid 
iC) STRANDEDNESS: single 
' 0 ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 



(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 25 

(D) OTHER INFORMATION: /note= "HUMOl primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 
ACGTGGTCTC- CGTGACATAA AAGAA 



(2) INFORMATION FOR SEQ ID MO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURS : 

(A) NAME/KEY: - 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION : /note- "HUM02 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: 
AGGTCTTTCT TTTATGTCAC GGA 



(2) INFORMATION FOR SEQ ID NO: 496: 

(1) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) * STRANDEDNESS : single 
{ D } TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 22 

(0) OTHER INFORMATION: /noCe= "HUM03. primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 
CACAGACCCC CGTCGCCTGG TC 



- (2) INFORMATION FOR SEQ ID MO: 4 97: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE : 

(A) NAME/ KEY : . - 

(B) LOCATION: 1. .23 

(D) OTHER INFORMATION: /.note = "HUM04 primer" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 4 97: 
CGGAGTCTGG ACGTCAGCAG GGC 
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(2) INFORMATION FOR SEQ ID NO: 498: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE : , nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA . 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..3 9 

(D) OTHER INFORMATION: /note- "SLW FIN primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 98 : 
CGCGGATCCG TAACTAAAAT GCCGCGCGCT CCCCGCTGC 



(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(A) NAME /KEY: - 

(3) LOCATION : 1 . . 42 

(0) OTHER INFORMATION: /not:e = "SLW F1C primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 499: 
CCGGAATTCG TTAGTTACTT ACAAAGAGGT GGCTTCTTCG GC 



(2) INFORMATION FOR SEQ ID NO: 5 00: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(Ik) FEATURE : 

(A) NAME /KEY: - 

O) LOCATION: 1. . 39 

(D) OTHER INFORMATION: /note= "SLW F2N primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 500: 
CGCGGATCCG TAACTAAAGC CACCTCTTTG GAGGGTGCG 3 9 



(2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH:- 42 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



<ix) FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1 . . 42 

(D) OTHER INFORMATION: /note- "SLW F2C primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
CCGGAATTCG TTAGTTACTT AAGACCTGAG CAGCTCGACG AC 



(2) INFORMATION FOR SEQ ID NO: 502: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 39 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA. 



FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION: 1 . . 39 

(D) OTHER INFORMATION: /note~ "SLW F3N primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 
CGCGGATCCG TAACTAAAAT GAGTGTGTAC GTCGTCGAG 



(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : . 40 base pairs 
(BJ TYPE: nucleic acid 

(C) ' STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DMA 



(ix) . FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 42 

(D) OTHER INFORMATION: /note= "SLW E3C primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503; 
CCGGAATTCG TTAGTTACTT AGATCCCCTG GCACTGGACG 



(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3) LOCATION : 1 . . 39 

(D) OTHER INFORMATION: /note- "SLW F4M primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:504: 
CGCGGATCCG' TAACTAAAAT CCCGCAGGGC TCCATCCTC 



(2) I N FORMAT I ON FOR SEQ ID MO: 50 5: 

■ (i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : " 4 2 base pairs 
(3) TYPE: nucleic acid 

(C) 'STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ i i ) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1..4 2 

(D) OTHER INFORMATION: /note* "SLW E4C primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505: 



CCGGAATTCG TTAGTTACTT AGTCCAGGAT GGTCTTGAAG TC 
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(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "phosphorot hioa te " 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . ; 21 

(D) OTHER INFORMATION: /note= "40-60 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:506: 



GGCATCGCGG GGGTGGCCGG G 
(2) INFORMATION FOR SEQ ID NO: 5 07: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " phosphoro thioa t e " 



lix) FEATURE: 

(A) NAME/ KEY: - 

{ B ) LOCATION: 1 . . 21 

{D)~ OTHER INFORMATION: /note= "260-230 prime 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507: 
GGACACCTGG CGGAAGGAGG G 



(2} INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
_{B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc - " phosphorot hioa te " 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1..21 

(D) OTHER INFORMATION: /note= "500-520 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO-: 508: 
GCGTGCCAGC AGGTGAACCA G 



(2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc » "phosphorothioate " 



(ix) FEATURE: 

(A) NAME/ KEY : 

(B) LOCATION : 1 . . 21 

(D) OTHER INFORMATION: -/note- "770-790 primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 
CTCAGGGGCA GCGCCACGCC T 

[2) INFORMATION FOR SEQ ID MO: 510: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 21 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc '* phosphorot hioa t e ' 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. .21 

(D) 'OTHER INFORMATION : /note* "885-905 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 510: 
AGGTGGCTTC TTCGGCGGGT C 



294 



(2) INFORMATION FOR SEQ ID NO : 5 1 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "phosphorothioate" 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .21 

(D) OTHER INFORMATION: /note= "1000-1020 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511: 
GGACAAGGCG TGTCCCAGGG A 



(2) INFORMATION FOR SEQ ID NO: 512: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
{3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "phosphorothioate" 



{!:•:) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . .21 

(DTOTHER INFORMATION: /note^ "1300-1320 primer" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 512: 
GCTGGGGTGA CCGCAGCTCG C 



(2} INFORMATION FOR SEQ ID MO: 513: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(3) TYPE : nucleic acid 
'(C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ^ "phosphorothioate" 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note= "1520-154-0 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513: 
GATGAACTTC TTGGTGTTCC T 



(2} INFORMATION FOR SEQ ID NO: 514: 

(i) SEQUENCE CHARACTERISTICS : 

(A) -LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "phosphoro thioa te " 



(i:-:) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note= "2110-2130 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 
GTGCGCCAGG CCCTGTGGAT A 



[2] INFORMATION FOR SEQ ID NO : 5 1 5 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 1 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "phosphoro thioa te 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note= "2295-2315 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO : 5 1 5 : 
GCCCATGGGC GGCCTTCTGG A 
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(2) INFORMATION FOR SEQ ID MO : 5 1 6 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "phosphoro thioa te ' 



(ix) FEATURE: 

(A) NAME/KEY: - 

(8) LOCATION :■ I. .21 

(D) OTHER INFORMATION : /note- "2450-2470 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 6 : 



GAGGCCACTG CTGGCCTCAT T 



(2) INFORMATION FOR SEQ ID NO: 517: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ~ "phosphoro thioa te " 



:-:) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 21 

{ D ) OTHER INFORMATION: /note= "2670-2690 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5 17: 
GGGTGAGGTG AGGTGTCACC A 



(2) INFORMATION FOR SEQ ID NO: 518 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: .other nucleic acid 

(A) DESCRIPTION: /desc « "phosphoro thioa te " 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. .31 

(D) OTHER INFORMATION: /note- "3080-3110 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 
GCTGCAGCAC ACATGCGTGA AACCTGTACG C . 31 



(2) INFORMATION FOR SEQ ID NO: 5 19: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc « "phosphorothioa t e ' 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note- "3140-3160 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 
GACGCGCAGG AAAAATGTGG G 21 



(2) INFORMATION FOR SEQ ID NO:520:. 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(3} TYPE: nucleic acid 

(C) STRANDEDMESS: single 

( D ) TOPOLOGY: linear' 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "phosphorothioa te " 



(A) NAME/KEY: - 

(B) LOCATION: 1 . .21 

(D) OTHER INFORMATION: /note= "3690-3710 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520: 
CCG^GCGCCh GCCTGTGGGG A 



21 
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(2) INFORMATION FOR SEQ ID NO: 521: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "phosphorothioa te 



<ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 21 ■ 

(D) OTHER INFORMATION: /note- "55-75 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 
CAGCGGGGAG CGCGCGGCAT C 



(2} INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: -21 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
{DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = " phosphorot hi oa t e " 



(A) NAME/KEY: - 

(3) LOCATION: 1 . . 21 

(0} OTHER INFORMATION: /note- "151-171 prirr ; 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 
CAGCACCTCG CGGTAGTGGC T 



(2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION: /note- "TP1 . 1 primer 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: 
TCAAGCCAAA CCTGAATCTG AG 2 2 



(2) INFORMATION FOR SEQ ID NO: 524: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - . 

(3) LOCATION: 1..21 

CD) OTHER INFORMATION: /note* "TP1.2 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:524: 
CCCGAGTGAA TCTTTCTACG. C 21 



(2) INFORMATION FOR SEQ ID MO: 525: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 



(A) NAME/KEY: - 

(S) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note- "TP1.3 primer 



(:<i) SEQUENCE DESCRIPTION: SEQ ID MO:525: 
GTCTCTGGCA GTTTCCTCAT CCC 2 3 



(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:, DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 22 

(D) OTHER INFORMATION : /note- "TP1.4 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526: 
TTTAGGCATC CTCCCAAGCA CA 22 



\2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(xi)" SEQUENCE DESCRIPTION: SEQ ID NO:527: 
7AGGGTTAG 10 

INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nucleic acid 
[Z) STRANDEDNESS : single ■ 
■C) TOPOLOGY: linear 

; ii) :-::iECULE type: dma 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: 
7AGGGTTAG GGTTAGGG 13 



INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1S V base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 2 9 : 
GT7AGGGTTA GGGTTAGG 13 
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(2) INFORMATION FOR SEQ ID NO: 530.: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:<) FEATURE: 

(A) NAME/ KEY : repeat_unit 

(B) LOCATION: 1. . 6 

( D) OTHER INFORMATION: /note* ''sequence (CCCTAA)-n, where n is 

at least L, or at least 3, or -at least 
10 or more" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 
CCCTAACCCT AACCCTAACC CTAACCCTAA CCCTAACCCT AACCCTAACC CTAACCCTAA 60 



(2) INFORMATION FOR SEQ ID NO:'5 31:. 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 34 base pairs 
(3) TYPE: nucleic acid 

(C) ' STRANDEDNESS: single' 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: dna 



(::-:) FEATURE: 

(A) NAME /KEY: mis cofeature 
(3) LOCATION: L . . 30 

(D) OTHER INFORMATION: /noHe= " non- t e Lome r ic nucleotide 

sequence, (X)-n, where n is 3-20, 
or '6-30" 



(mi) SEQUENCE DESCRIPTION: SEQ ID MO : 5 3 1 : 
NNNNNNM NMM NNNNNNNNNN NNNNNN N MNN TTAG 

(2) INFORMATION FOR SEQ ID NO: 5 32: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 34 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME /KEY : misc_f ea ture 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note* "non-telomeric nucleotide 

sequence, (X)-n, where n is 8-20, 
or 6-30" 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
MMMNNNNNNN MNNNNNNNNN NMNNNNNNNN AGGG 



(2) INFORMATION FOR SEQ ID NO: 533: 

(i)- SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_£ea ture 
(3) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note 3 "non-telomeric nucleotide 

sequence, (X)-n, where n is 3-20, 
or*6-30" 



(xi) sequence description: seq id no: 533: 
nnmnm:: unnnnmnmnn mmnmnmnnmm ttagggttag 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 46 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : misc_ f ea tu re 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note^ "non-telomeric nucleotide 

sequence, (X)-n, where n is 8-20, 
or 6-30" 



(xi) SEQUENCE DESCRIPTION: 5£Q ID NO: 534: 
NMNNNNNNNN NMNNNNNNNN NNNMNNN NMN TTAGGGTTAG GGTTAG 
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(2) INFORMATION FOR SEQ ID NO: 535; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : misc_f eature 
CB) LOCATION: 1 . .30 

(D) OTHER INFORMATION: /note- "non-telomeric nucleotide 

sequence/ {X)-n, where n is 8-20, 
or '6-30" 



(xi) SEQUENCE DESCRIPTION: SEQ ID'NO:535: 
iMNNNMNNMN NNNMMNNM NN MNNNNNMNNN TTAGGGTTAG GGTTAGGGTT AG 



!;. INFORMATION FOR SEQ ID MO: 53 6.: 

(i) SEQUENCE CHARACTERISTICS:' 
(A) LENGTH: 53 base pairs 
(3) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: ' DNA 



(AJ NAME/KEY; mi sc_5ea t ure 
( 3} LOCATIO;-!: I . . 30 

CD) OTHER INFORMATION: /note= "non- te lome r ic nucleotide 

sequence, (X)-n, where n is 8-20, 
or 6-30" 



(:<i) SEQUENCE DESCRIPTION': SEQ ID NO: 536: 
■:MNMMMMNMN NNNNMN M N MM NMNNMNNNNM TTAGGGTTAG GGTTAGGGTT AGGGTTAG 



(2) INFORMATION FOR SEQ ID NO: 5 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) L&MGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 10 

(D) OTHER INFORMATION; /note= "primer terminating at template 

boundary" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 537: 
TAGGGATTAG ' ' 10 



(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS : 
.(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
( D } TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:-:) FEATURE: 

(A) NAME /KEY : - . 
(3) LOCATION: 1 . . 16 

(D) OTHER INFORMATION: /ao:e= "primer for hTRT nueleolytic 

assav" 



FEATURE: 

(A) NAME /KEY: modi f i ed_base 
(3) LOCATION: 16 

(D) OTHER INFORMATION: /mod_base- OTHER 

/note- "N - 3 ' -deo:-:yguanos in* 



SEQUENCE DESCRIPTION: SEQ I D NO: 533 : 
I- GGTT AG GGTTAN . 16 



[2) INFORMATION FOR SEQ ID NO: 539: 

(1) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 60 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY : repeat_unit 

( B ) LOCATION ; 1..6 

( D ) OTHER INFORMATION: /note= "sequence ( TTAGGG ) -n, where n 

is 1-10, or typically 3-5" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:539: 
TTAGGGTTAG GGTTAGGGTT AGGGTTAGGG TTAGGGTTAG GGTTAGGGTT AGGGTTAGGG 



(2) INFORMATION FOR SEQ ID NO: 540: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:540: 

Trp Xaa Gly Xaa Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 54 1: 

(i) SEQUENCE CHARACTERISTICS : ' 
(A) LENGTH: 4 amino acids 
{3} TYPE: amino acid 
(C) STRANDEDNESS: 
{ D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



[y.L) SEQUENCE DESCRIPTION: SEQ I D NO: 54 1: 
?:o Arg Arg Arg 



(2) INFORMATION FOR SEQ ID NO : 5 4 2 : 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS : 
{ D ) . TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

Pro Lys Arg Pro Arg Arg 
1 5 
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(2) INFORMATION FOR SEQ XD MO: 54 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) SAME/KEY: - 

(B) LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /note- "antisense hTRT molecule" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID MO : 5 4 3 : 
GCTCTAGAAT GAAGGGTG 



(2) INFORMATION FOR SEQ ID MO: 54 4: 

(i) SEQUENCE CHARACTERISTICS: 
{ A) LENGTH: 21 base pairs 
(3) TYPE: nucleic. acid 
CO STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION : I . .21 

CD) OTHER I M FORMAT ION : /note= "SI control oligonucleotide" 



(xi) sequence description: seq id no : 5 4 4 : 
g::-acgactg acattggccg g 



( 2) INFORMATION FOR SEQ ID MO: 54 5: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(!;<:> FEATURE : 

(A) NAME /KEY: - 

(3) LOCATION : 1 . . 21 

(D) OTHER INFORMATION: /note= "S2 control oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545: 
GGCTCGAAGT AGCACCGGTG C 



(2) INFORMATION FOR SEQ ID NO:546: ' 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(BJ LOCATION: 1 . . 21 

(D) OTHER INFORMATION: /note- "S3 control oligonucleotide" 

(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 
GTGGGAACAG GCCGATGTCC C 



(2) INFORMATION FOR SEQ ID MO: 54 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
. (3) TYPE: nucleic acid 
(C) STRANDEDMESS : single 
( 0) TOPOLOGY; linear 

( ii) MOLECULE TYPE: DNA 



(A) NAME /KEY: - 

(3) LOCATION : 1 . . 30 

(0) OTHER INFORMATION:' /note* "ON I oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 
CCCACGTGC GCAGCAGGAC GCAGCGCTGC 



2) INFORMATION FOR SEQ ID MO: 54 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDMESS : single 
(Q) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note- "ON2 oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 4 8 : 
GCCGGGGCCA GGGCTTCCCA CGTGCGCAGC 



(2} INFORMATION FOR SEQ ID NO: 54 9: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
(3) T-YPE: nucleic acid 

(C) STRANDEDNESS: single" 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) ' NAME /KEY : - 
(3)' LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note*- "ON3 oligonucleotide" 



■{xi) SEQUENCE DESCRIPTION: SEQ ID MO : 5 4 9 : 
GGCATCGCGG GGGTGGCCGG GGCCAGGGCT 



INFORMATION FOR SEQ ID NO: 550: 

(L) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS :- single 
(0) TOPOLOGY: linear 

{ii) N.OLECULE TYPE: DMA 



FEATURE : 

(A) NAME/ KEY: - 

(3) LOCATION : 1 . . 20 

(D) OTHER INFORMATION: /note= "delta-182 species spec 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:550: 



GGCACTGGAC GTAGGACGTG 
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(2) INFORMATION FOR SEQ ID NO: 551: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH : 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 20 

(D) OTHER INFORMATION: /note= "293 cell line lambda cDNA 

library PCR primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:551: 
CGGAAGAGTG TCTGGAGCAA 20 

(2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(A) NAME /KEY: - 

(3) LOCATION : i . . 2 3 

{ o ) OTHER INFORMATION: /note= "GAPDH control prirr.er K136" 

(:-:i) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 
CXCAGACACC ATGGGGAAGG TGA 23 



(2) INFORMATION FOR SEQ ID NO:553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(A) NAME/KEY: - 

(B) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note- "GAPDH control primer K137" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 553: 
ATGATCTTGA GGCTGTTGTC ATA 



(2) INFORMATION FOR SEQ ID NO : 5 5 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION^ 1 . . 31 

"(D) OTHER INFORMATION: /note* "competitor oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID '.NO: 554: 
TAGACCTGTT AGTGTACATT TGAATTGAAG C 



(2) INFORMATION FOR SEQ ID NO: 5 55: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(0) TOPOLOGY": linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note= "competitor oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55 5: 
TAGACCTGTT AGGTTGGATT TGTGGCATCA 



(2) INFORMATION FOR SEQ ID NO: 5 56: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



311 



Ik) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . . 26 

(D) OTHER INFORMATION : /note- "biot i n- labe led 

telomerase- specific oligonucleotide 



(ix) FEATURE: 

(A) NAME/KEY: modi f ied_base 
.(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note- "N - thymidine modified by a 
biotin group" 

(ix) FEATURE: 

(A) NAME /KEY : modi f ied_base 
"(B) LOCATION: 12 . - 13 
(D) OTHER INFORMATION: /mod_base= gm 

(ix) FEATURE; 

(A) NAME/KEY: modi f i ed_ba s e 
(3) LOCATION: 14. .17 

(0) OTHER INFORMATION: /mod_base= urn 

(ix) FEATURE: 

"(A) NAME /KEY : modi f iedjoase 
(3) LOCATION : 18 . . 21 

CD) OTHER INFORMATION: /mod_base= gm 

(ix) FEATURE: 

(A) MAMS/KEY: modi £ ied_base 
(3) LOCATION : 2 2. .25 

( D ) OTHER INFORMATION: /mod_base= urn 

(ix) FEATURE: 

' (A) NAME/KEY: modi fi ed_oa se 
■ (3) LOCATION: 26 
(0) OTHER INFORMATION: /mod base = gm 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 5 6 : 
[AGACCTGTT ANMNMMN NNM NNNNNN 



(2) IlMFORlMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 base pairs 
(3) TYPE: nucleic acid 
(C) S T RAN D E DM E S S : single 
CD) TOPOLOGY: linear 

(Li) MOLECULE TYPE: DNA 



(A) NAME /KEY: - 

(3) LOCATION: 1 . .31 

(0) OTHER INFORMATION: 



/note= "displacement 
deoxyol igonucleo t ide " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:557: 
CAAAACCCCA AAACCTAACA GGTCTA 26 

(2) INFORMATION FOR SEQ ID NO : 5 5 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 base pairs 

(8) TYPE: nucleic acid • 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(i;<) FEATURE: 

(A) . NAME/KEY: - 

(B) LOCATION: 1..103 

(D) OTHER INFORMATION: /note- "5' primer encoding a 

hammerhead ribozyme, a T7-promoter and 
an EcoRl site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 
GCGGGAATTC TAATACGACT CACTATAGGG AAGAAACTCT GATGAGGCCG AAAGGCCGAA ' 60 

ACTCCACGAA AGTGGAGTAA GTTTCTCGAT AATTGATCTG TAG 103 

(2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 
(A)" LENGTH : 36 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
( 0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(i;<) FEATURE: 

(A) NAME /KEY: - 

■(B) LOCATION: I. .36 „ - - h(a 

(0) OTHER INFORMATION: /note= "3' primer including an carl site 

and a BamHl site" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

3 6 

CGGGGATCCT C T T C AAAAG A TGAGAGGACA GCAAAC 



(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
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(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 60 

(D) OTHER INFORMATION: /note= "hairpin primer" 



(Ki) SEQUENCE DESCRIPTION: SEQ ID NO:560: 
CCCCAAAACC CCAAAACCCC AAAACCCCCA CAGGGGTT.TT GGGGTTTTGG GGTTTTGGGG 60 



(2) INFORMATION FOR SEQ ID NO: 561: 

-U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 58 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(lk) FEATURE: 

(A) NAME /KEY: - 

(3} LOCATION: 1..5 3 

'(D) OTHER INFORMATION: /note* "hairpin primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 6 L : 
CCAAAACCCC AAAA.CCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTTGGGG 53 



f 2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 56 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(i:<) FEATURE: . 

(A) NAME/KEY: - 

(3) LOCATION : 1 . . 5 6 

(D) OTHER INFORMATION: /note= "hairpin primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 
AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTTTGGGG TTTTGGGGTT TTGGGG 



56 
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(2) INFORMATION FOR SEQ ID MO: 563: 

(i) SEQUENCE CHARACTERISTICS: 
- (A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) . LOCATION: 1 . . 54 

(D) OTHER INFORMATION: /note- "hairpin primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 563: 
AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT GGGG 



{2) INFORMATION' FOR SEQ ID NO:564: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



NAME /KEY: - 
LOCATION: 1 . .43 

OTHER INFORMATION; /n.o:e= "hairpin primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 
CCCCAAAACC CCAAAACCCC CACAGGGGTT TTGGGGTTTT GGGGTTTT 



(2) INFORMATION FOR SEQ ID MO: 5 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: I . . 52 

(D) OTHER INFORMATION: /note= "hairpin primer" 



(A) 
(3) 
(D) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 
AAAACCCCAA AACCCCAAAA CCCCCACAGG GGTTXTGGGG TTTTGGGGTT TT 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1 . . 50 

(D) OTHER INFORMATION: /note= "hairpin primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566: 
AACCCCAAAA CCCCAAAACC CCCACAGGGG TTTTGGGGTT TTGGGGTTTT 



(2) INFORMATION FOR SEQ ID NO: 5 67: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH : 46 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(Q) TOPOLOGY: linear 

( ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

- (A) NAME/ KEY: - 

(B) LOCATION: 1 . . 46 

(D) OTHER INFORMATION: /note* "hairpin primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 
CCAAAACCCC AAAACCCCCA CAGGGGTTTT GGGGTTTTGG GGTTTT 



(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 44 

(D) OTHER INFORMATION: /note= "hairpin primer" 



(xi)- SEQUENCE DESCRIPTION: SEQ ID NO:563: 
AAAACCCCAA AACCCCCACA GGGGTTTTGG GGTTTTGGGG TTTT- 



(2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : single 
■ (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



{!:<) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . .27 

(D) OTHER INFORMATION: /note= "degenerate oligonucleotide 

primer for cloning of Euplotes 
telomerase 123 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION': SEQ ID MO: 569: 
7C7RAARTAR TGDGTNADRT TRTTCAT 



(2) INFORMATION FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 31 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDMESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA - 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION : 1 . . 31 

(D) OTHER INFORMATION: /note- "degenerate oligonucleotide 

primer for cloning of Euplotes 
' teiorcerase 123 kDa protein subunit" 



(:■:!) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 
GCGGATCCAT GAAYCCWGAR AAYCCWAAYG T 
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(2) INFORMATION FOR SEQ ID NO: 571: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 20 

(D) . OTHER INFORMATION :- /note= "degenerate oligonucleotide 

primer for cloning of Euplotes ■ 
teioraerase 43 kDa protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 

tvNNGTNACHG GHATHAAYAA 



(2) INFORMATION FOR SEQ ID NO: 572: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

' (ii) MOLECULE TYPE: DMA 



{!:<) FEATURE: 

(A) NAME/KEY: - 

(3} -LOCATION: 1. .21 

(0) OTHER INFORMATION: /note= "degenerate oligonucleotide 

primer for cloning of Euplotes 
telomerase 43 !<Da protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 
CGCDGT YTC Y TGRTCRTTRT A 



(2) INFORMATION FOR SEQ * ID NO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 23 
(D) -OTHER INFORMATION: 



/note= "degenerate oligonucleot ids 
primer for cloning of Oxytricha 
trifallax telomerase homologous, to 
Euplotes 123 k.Da protein subunit" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: 
ACHAARG GHATYCCHYA RGG 



INFORMATION FOR SEQ ID MO: 57 4: 

(i) SEQUENCE CHARACTERISTICS: 
'{A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAlME/KEY: - 

(3) LOCATION: 1, . 23 

(D) OTHER INFORMATION: /note= "degenerate oligonucleotide 

primer for cloning of Oxytricha 
trifallax telomerase homologous to 
Euoloces 123 kDa protein sub-unit" 



(xi} SEQUENCE DESCRIPTION: SEQ ID NO: 57 4: 
DAT M ARM ARRTARTCRT C 



INFORMATION FOR SEQ ID NO : 5 7 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs. 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: I . .23 

(D) OTHER INFORMATION: /note* "K231 forward primer" 
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(ix) FEATURE : ■ 

(A) NAME/ KEY: modi f iedjoas e 

(B) LOCATION: 1 

'(D) OTHER INFORMATION: /mod_base- OTHER 

/note- "N - guanosine modified by 
biotin group" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
ATTTYT TYTAYNNNAC NGA 



INFORMATION FOR SEQ ID NO: 576: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . .23 

(D) OTHER INFORMATION: /ao:e= "K220 reverse primer" 



(:d) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 
GAT AT MA DMARRAARTC RTC 



INFORMATION 'FOR SEQ ID NO: 57 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(i:<) FEATURE: 

(A) NAME/ KEY : Modi f ied-s i te 
(3) LOCATION: 5 

(0) OTHER INFORMATION: /product- "OTHER" 

/note 3 "Xaa = ?he, He or Leu" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 577: 

Aso Aso Phe Leu Xaa He 
1 L ■ 5 
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(2) INFORMATION FOR SEQ ID MO: 573: 

(i) SEQUENCE CHARACTERISTICS: 

(A) "LENGTH : 23 base pairs 

(B) . TYPE: nucleic " acid 

(C) STRANDEDNESS : single 
• (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 23 

(D) OTHER INFORMATION: /note- "K228 forward primer" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO:578: 
ACAATGMGNH TNHTNCCNAA RAA 



(2-). INFORMATION FOR SEQ ID NO: 57 9: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
. (3) TYPE: amino acid 
(C) STRANDSDNESS : 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

( A ) N AM E / KE Y : Mod i f led- site 
(3) LOCATION: 2 . . 3 

(D) OTHER INFORMATION: / product = "OTHER" 

/note= "Kaa » Leu or lie" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID MO: 579: 

Arg Xaa Kaa Pro Lys Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 530: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDMESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(3} LOCATION: 1. .26 

(D) OTHER INFORMATION: /note- "K224 reverse primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:580: 
ACGAATCKMG GDATNSWRTC RTARCA 2 6 



(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:581: 

Cys Tyr Asp Ser lie Pro Arg 
1 5 



(2) INFORMATION FOR SEQ ID NO: 58 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE- DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . .26 

(0} OTHER INFORMATION: /note= "K227 reverse primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:582:. 
CAA.TTCTCRT ARCANSWYTT DATRTC 2 6 



(2) INFORMATION FOR SEQ ID MO: 583; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:533: 



Asp He Lys Ser Cys Tyr Asp 
1 5 
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(2) INFORMATION FOR SEQ ID MO: 53 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..269 

(D) OTHER INFORMATION: /note= "clone 168-3'-' 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:584: 

GATTACTCCC GAAGAAAGGA TCTTTCCGTC CAATCATGAC TTTCTTAAGA AAGGACAAGC 60 

AAAAAAAT AT TAAGTTAAAT CTAAATTAAA TTCTAATGGA TAGCCAACTT GTGTTTAGGA 120 

ATTTAAAAGA CATGCT GGG A TAAAAG AT AG GATACTCAGT CTTTGATAAT AAACAAATTT 180 

C AG AAAAA T T TGCCTAATTC ATAGAGAAAT GGAAAAATAA AGGAAGACCT CAGCTATATT 24 0 

ATGTCACTCT AGACATAAAG ACTTGCTAC 2 69 



(2) INFORMATION FOR SEQ ID NO: 53 5: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 20 base pa-irs 
(3) TYPE: "nucleic acid 
(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



<i:<) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..23 

(D) OTHER INFORMATION: /note- "K297 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: 
GAGTGACATA ATATACGTGA 



(2) INFORMATION FOR SEQ ID NO: 53 6: 

(I) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 474 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . . 474 

(D) "OTHER INFORMATION: /note= "Tetrahymena DNA fragment" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 



AAACACAAGG 


AAGGAAGTCA 


AATATTCTAT 


TACCGTAAAC 


C AAT AT GG AA 


ATTAGTGAGT 


60 


AAATTAACTA 


TTGTCAAAGT 


AAGAATTTAG 


TTTTCTGAAA 


AGAATAAATA 


AATGAAAAAT 


120 


AATTTTTATC 


AAAAAATTTA 


GCTTGAAGAG 


GAGAATTTGG 


AAAAAGTTGA 


AGAAAAATTG 


18 0 


ATACCAGAAG 


ATTCATTTTA 


GAAATACCCT 


CAAGGAAAGC 


TAAGGATTAT 


AC C T AAAAAA 


240 


GGATCTTTCC 


GTCCAATCAT 


GACTTTCTTA 


AGAAAGGACA 


AGCAAAAAAA 


TATTAAGTTA 


300 


AATCTAAATT 


AAATTCTAAT 


GGATAGCCAA 


CTTGTGTTTA 


GGAATTTAAA 


AGACATGCTG 


360 


GGATAAAAGA 


TAGGATACTC 


AGTCTTTGAT 


AAT AAAC AAA 


TTTCAGAAAA 


ATTTGCCTAA 


420 


TTCATAGAGA 


AAT G G AAAAA 


TAAAGGAAGA 


CCTCAGCTAT 


ATTATGTCAC 


TCTA 


474 



(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 base pairs 
(3) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i:<) FEATURE: 

_( A ) NAME /KEY : - 
(3) LOCATION : L . . 5 0 

(D) OTHER INFORMATION: /c\ote= "Q-T primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 
CCAGTGAGCA GAGTGACGAG GACTCGAGCT CAAGCTTTTT TTTTTTTTTT 



(2) INFORMATION FOR SEQ ID NO: 58 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(8) TYPE: nucleic acid 

(C) STRAMDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 13 

(D) OTHER" INFORMATION: /note- "Q-0 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:588: 
CCAGTGAGCA GAGTGACG 18 



(2) INFORMATION FOR SEQ ID NO: 589: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic- acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cii) MOLECULE TYPE: DNA 



{ ixr ■ FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 32 

(0) OTHER INFORMATION: /note= "M2-T primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:589: 
GTGTCATTTC TATATGGAAG ATTTGATTGA TG ' 3 2 



(2) INFORMATION FOR SEQ ID NO: 590: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: -18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

{ 8 ) LOCATION: 1 . . 18 

(D) OTHER INFORMATION: /note= H Q-I primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 590: 



GAGGACTCGA GCTCAAGC 



18 
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(2) INFORMATION FOR SEQ ID NO: 5 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 32 

(D) OTHER INFORMATION: /note= "M2-T2 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 
ACCTATCGTT TACGAAAAAG AAAGGATCAG TG 



(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 32 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA - 



(i:<) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION:- 1 . . 32 

(D) OTHER INFORMATION: /note= "M2-B primer" 



(xi) SEQUENCE .DESCRIPTION: SEQ ID MO: 592 : 
CACTGATCCT TTCTTTTTCG TAAACGATAG GT 



(2) IN FORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 31 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
(D.) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . . 31 

(D) OTHER INFORMATION: /note= "M2-B2 primer 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 
CATCAATCAA ATCTTCCATA TAGAAATGAC A 31 



(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 27 

(D) OTHER INFORMATION: /note- "PCR Adapt Sfil primer" 



(ix), FEATURE: 

(A) ' NAME/ KEY : modi £ iedjoase 

(B) LOCATION: 1 

(D) OTHER INFORMATION: /mod_b~ase = OTHER 

/note- M N = 5 * -phosphorylated guanosine" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 
NGGCCGTGTT GGCCTAGTTC TCTGCTC 27 



INFORMATION FOR SEQ" ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 38 

(D) OTHER INFORMATION: /note- " PCR Adapt Sfill primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 
GAGGAGGAGA AGAGCAGAGA ACTAGGCCAA CACGCCCC 
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(2) INFORMATION FOR SEQ ID NO: 596: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..19 

(D) OTHER INFORMATION : /note- "TCP1.3 primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:596: 
ATGCTCCTGC GTTTGGTGG 



(2) INFORMATION FOR SEQ I D NO : 5 97 : 

(i) SEQUENCE CHARACTERISTICS.: 
(A) LENGTH: 26 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1..2S 

(D) OTHER INFORMATION: /note- "F3b primer" 



(xi)-' SEQUENCE DESCRIPTION: SEQ ID MO: 597: 
TCTAACCCTA ACTGAGAAGG GCGTAG 



(2) INFORMATION FOR SEQ ID NO: 593: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 26 

(D) OTHER INFORMATION: /note= ,T R3c primer" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 
GTTTGCTCTA GAATGAACGG TGGAAG 2 6 

(2) INFORMATION FOR SEQ ID NO: 599: 

(i) SEQUENCE CHARACTERISTICS: 

" (A) LENGTH: 170 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 17 0 

(D) OTHER INFORMATION: /note* "104-base intronic sequence 

(SEQ ID NO:7) inserted at the junction 
corresponding to bases 274 and 2.75 of 
hTRT (SEQ ID NO: 1) " 

(ix) FEATURE: 

(A) NAME/ KEY : intron 

(B) LOCATION: 28 . .131 

(D) OTHER INFORMATION: /note- "intronic sequence" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:599: 

CCCCCCGCCG CCCCCTCCTT CCGCCAGGTG GGCCTCCCCG GGGTCGGCGT CCGGCTGGGG ' 60 

7TGAGGGCGG CCGGGGGGAA CCAGCGACAT GCGGAGAGCA GCGCAGGCGA CTCAGGGCGC 120 

TTCCCCCGCA GGTGTCCTGC CTGAAGGAGC TGGTGGCCCG AGTGCTGCAG 17 0 

(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1285 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1 . . 1235 

(D) OTHER INFORMATION: /note= "fusion protein composed of 

enterokinase cleavable, His tagged 
thioradoxin moiety and full length hTRT" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600: 

Met Ser Asp Lys lie lie His Leu Thr Asp Asp Ser Phe Asp Thr Asp 
1 5 10 15 

Val Leu Lys Ala Asp Gly Ala lie Leu Val Asp Phe Trp Ala His Trp 
20 25 30 

Cys Gly Pro. Cys Lys Met lie Ala Pro lie Leu Asp Glu lie Ala Asp 
35 40 45 

Glu Tyr Gin Gly Lys Leu Thr Val Ala Lys Leu Arg lie Asp His Asn 
50 55 60 

Pro Gly Thr Ala Pro Lys Tyr Gly He Arg Gly He Pro Thr Leu Leu 
65 70 - 75 80 

Leu Phe Lys Asn Gly Glu Val Ala Ala Thr Lys Val Gly Ala Leu Ser 
85 90 95 

Lys Gly Gin Leu Lys Glu Phe Leu Asp Ala Asn Leu Ala Gly Ser Gly 
100 - 105 HO 

Ser Glv Asp Asp Asp Asp Lys Val Pro Met His Glu Leu Glu lie Phe 
115 . 120 _ 125" 

Glu Phe Ala Ala Ala Ser Thr Gin Arg Cys Val Leu Leu Arg Thr Trp 
130 135 140 . 

Glu Ala Leu Ala Pro Ala Thr Pro Ala Met Pro Arg Ala Pro Arg Cys 
145 150 ' 155 160 

Arg Ala Val Arg Ser Leu Leu Arg Ser His Tyr Arg Glu Val Leu Pro 
165 170 17 5 

. Leu Ala Thr Phe Val Arg Arg Leu Gly Pro Gin Gly Trp Arg Leu Val 
180 135 190 

Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala Leu Val Ala Gin Cys Leu 
_ 195 - 200 20 5 

Val Cvs Val Pro Trp Asp Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe 
210 " 215 220 . 

Arg Gin Val Ser Cys Leu Lys Glu Leu Val Ala Arg Val Leu Gin Arg 
225 230 235 240 

Leu Cys Glu Arg Gly Ala Lys Asn Val Leu Ala Phe Gly Phe Ala Leu 
245 250 255 

Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu Ala Phe Thr Thr Ser Val 
260 255 270 

Arg Ser Tyr Leu- Pro Asn Thr Val Thr Asp Ala Leu Arg Gly Ser Gly 
275 280 285 

Ala Trp Gly Leu Leu Leu Arg Arg Val Gly Asp Asp Val Leu Val His 
290 295 300 

Leu Leu Ala Arg Cys Ala Leu Phe Val Leu Val Ala Pro Ser Cys Ala 
305 . 310 315 320 
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Tyr Gin Val Cys 



Ala Arg Pro Pro 
340 

Glu Arg Ala Trp 
355 

Leu Pro Ala Pro 
370 

Leu Pro Leu Pro 
385 

Arg Thr Pro Val 



Gly Pro Ser Asp 
420 

Glu Glu Ala Thr 
4 3-5 

His Pro Ser Val 

■4 50 

Arg Pro Pro Arg 

4 6 5 

Thr Lys His Phe 



Ser Phe Leu Leu 
500 

Leu Val Glu Thr 
515 

Pro Arg Arg Leu 
530 

Leu Phe Leu Glu 
545 

Leu Leu Lys Thr 



Gly Val Cys Ala 
580 

Glu Glu Asp Thr 
595 

Ser Ser Pro Trp 
610 

Leu Val Pro Pro 
625 



Gly Pro Pro Leu 
325 

Pro His Ala Ser 



Asa His Ser Val 
360 

Gly Ala Arg Arg 
375 

Lys Arg Pro Arg 
390 

Gly Gin Gly Ser 
405 

Arg Gly Phe Cys 



Ser Leu Glu Gly 
440 

Gly Arg Gin His 
4 55 

Pro Trp Asp Thr 
470 

Leu Tyr Ser Ser 
485 

Ser Ser Leu Arg 



lie Phe Leu Gly 
520 

Pro Arg Leu Pro 
535 

Leu Leu Gly Asn 
550 

His Cys Pro Leu 
565 

Arg Glu Lys Pro 



Asp Pro Arg Arg 
600 

Gin Val Tyr Gly 
615 

Gly Leu Trp Gly 
630 



Tyr Gin Leu Gly 
330 

Gly Pro Arg Arg 
345 " 

Arg Glu Ala Gly 



Arg Gly Gly Ser 
330 

Arg Gly Ala Ala 
395 

Trp Ala His Pro 
4 10 

Val Val Ser Pro 
425 

Ala Leu Ser Gly 



His Ala Gly Pro 
460 

Pro Cys Pro Pro 
475 

Gly Asp Lys Glu 
490 

Pro Ser Leu Thr 
505 

Ser Arg Pro Trp 



Gin Arg Tyr Trp 
540 

His Ala Gin -Cys 
555 

Arg Ala Ala Val 
570 

Gin Gly Ser Val 
585 

Leu Val Gin Leu 



Phe Val Arg . Ala 
620 

Ser Arg His Asn 
635 



Ala Ala Thr Gin 
335 

Arg Leu Gly Cys 
350 

Val Pro Leu Gly 
365 

Ala Ser Arg Ser 



Pro Glu Pro Glu 
400 

Gly Arg Thr Arg 
415 

Ala Arg Pro Ala 
430 

Thr Arg His Ser 
4 45 

Pro Ser Thr Ser 



Val Tyr Ala Glu 
480 

Gin Leu Arg Pro 
495 

Gly Ala Arg Arg 
510 

i-'.ez Pro Gly Thr 
525 

Glr\ Met Arg Pro 



Pro Tyr Gly Val 
560 

Thr Pro Ala Ala 
575 

Ala Ala Pro Glu 
590 

Leu Arg Gin His 
605 

Cys Leu Arg Arg 



Glu Arg Arg Phe 
640 
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Leu Arg Asn Thr Lys Lys Phe lie Ser Leu Gly Lys His Ala Lys Leu 
645 650 655 

Ser Leu Gin Glu Leu Thr Trp Lys Met Ser Val Arg Asp Cys Ala Trp 
660 665 670 

Leu Arg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg 
675 680 685 

Leu Arg Glu Glu lie Leu Ala Lys Phe Leu His Trp Leu Met Ser Val 
690 695 700 

Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Vai Thr Glu Thr Thr 
705 710 715 720 

Phe Gin Lys Asn Arg Leu Phe Phe -Tyr Arg Lys Ser Val Trp Ser Lys 
725 - 730 735 

Leu Gin Ser lie Gly lie Arg Gin His Leu Lys Arg Val Gin Leu Arg 
740 745 750 

Glu Leu Ser Glu Ala Glu Val Arg Gin His Arg Glu Ala Arg Pro Ala 
755 760 765 

Leu Leu Thr Ser Arg Leu . Arg Phe^ He, Pro Lys Pro Asp Gly 'Leu Arg 
770 775 780 

Pro lie Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg 
785 790 795 300 

Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser 
805 310 315 

VaL Leu Asn Tvr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser 
320 . E 25 8 30 

Val Leu Gly Leu Asp Asp lie His Arg Ala Trp Arg Thr Phe Vai Leu 

3 35 - 8 4 0" 34 5 

Arg VaL Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val 
350 * 855 860 

Asp val Thr Gly Ala Tyr Asp Thr He Pro Gin Asp Arg Leu Thr Glu 
3 65 8 70 87 5 330 

Val He Ala Ser He lie Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg 
885 ■ 890 895 

Tyr Ala Val Val Gin Lys Ala Ala His Gly His Val Arg Lys Ala Phe 
900 505 910 

Lys Ser His Val Ser Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin 
915 920 925 

Phe Vai Ala His Leu Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val 
930 935 940 

lie Glu Gin Ser Ser Ser Leu Asn Giu Ala Ser Ser Gly Leu Phe Asp 
945 " 950 955 560 



£1 
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Val Phe Leu Arg Phe Met Cys His His Ala Val Arg lie Arg Gly Lys 
965 970 975 

Ser Tyr Val Gin Cys Gin Gly lie Pro Gin Gly Ser lie Leu Ser Thr 
. 980 985 990 

Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala 
995 1000 1005 

Gly lie Arg Arg Asp Gly Leu Leu Leu Arg Leu Vai A_sp Asp Phe Leu 
1010 k 1015 1020 

Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu 
1025 1030 1035 1040 

Val Arg Gly Val Pro Glu Tyr Gly Cys Val Val Asn Leu Arg Lys Thr 
1045 1050 ' '1055 

Val Val Asn Phe Pro Val Glu Asp Glu Ala Leu Gly Gly Thr Ala Phe 
1060 1065 1070 

Val Gin Met Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu 
1075. 1080 1035 

Asp Thr Arg Thr Leu Glu Val Gin Ser Asp Tyr Ser Ser Tyr Ala Arg 
1090 1095 * 1100 

Thr Ser He Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly 
1105 1110 . 1115 1120 

Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His 
1125 1130 1135 



Ser Leu -Phe Leu Asp Leu Gin Val Asn Ser Leu Gin Thr Val Cys Thr 
1140 ' " 1145 1150 

Asn lie Tyr Lys lie Leu Leu Leu Gin Ala Tyr Arg Phe His Ala Cys 
1155 1160 1155 

Val Leu Gin Leu Pro Phe His Gin Gin Val Trp Lys Asn Pro Thr Phe 
1170 1175 1130 

Phe Leu Arg Val He Ser Asp Thr Ala Ser Leu Cys Tyr Ser He Leu 
1185 1190 " 1195 1200 

Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly 
1205 1210 1215 

Pro Leu Pro Ser Glu Ala Val Gin Trp Leu Cys His Gin Ala Phe Leu 
1220 1225 1230 



Leu Lys Leu Thr Arg His Arg Val- Thr Tyr Val Pro Leu Leu Gly Ser 
1235 1240 1245 

Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg Lys Leu Pro Gly Thr Thr 
1250 1255 1260 

Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe 

1265 1270 1275 1280 
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Lys Thr He Leu Asp 
1235 



(2) INFORMATION FOR SEQ ID NO : 601: 

(i) SEQUENCE CHARACTERISTICS:- 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:-601: 

Gly Ser Val Thr Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 602: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 538 amino acids 
(3) TYPE: amino acid ' 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME/KEY: Protein 
(3) LOCATION: 1 . - 533 
{ 0 ) OT H E R IN FO R.MAT I C N 



/ r.oze= "fusion protein composed of 
giutat'nione-S-transf erase sequence, . 
thro-bin cleavage sequence, recognition 
sequence for heart muscle protein 
kinase, residues introduced by cloning 
and hTRT protein fragment" 



xi) SEQUENCE DESCRIPTION: SFQ ID MO: 602: 



Met Ser Pro He Leu Gly Tyr Trp Lys lie Lys Gly Leu Val Gin Pro 
1 



5 ' " 10 15 



Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

Tyr Glu Arg Asd Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 * 40 45 

Gly L-u Glu Phe Pro Asn Leu Pro Tyr Tyr lie Asp Gly Asp Val Lys 
5u 55 60 

Leu Thr Gin Ser Met Ala lie lie Arg Tyr lie Ala Asp Lys His Asn 
65 70 75 30 
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Met Leu Gly Gly Cys Pro Lys Glu Arg Ala Glu lie Ser Met Leu Glu 
85 90 95 

Gly Ala Val Leu Asp lie Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 - U0 

Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 
115 120 125 

Met Leu Lys Met Phe Glu Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 
130 135 140 

Gly Aso His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 ^ 150 155 160 

Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 170 175 

Val Cys Phe Lys Lys Arg lie Glu Ala lie Pro Gin He Asp Lys Tyr 
180 185 190 

Leu Lys Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp "Gin Ala 
195 200 205 

Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Val Pro Arg 
210 215 ' 220 

Giv Ser Arg Arg Ala Ser Val Gly Ser Val Thr Lys lie Pro Gin Gly 
225 230 235 240 

Ser lie Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu 
245 250 255 

: sn Lvs r ^u Phe Ala Gly lie Arc Arg Asp Gly Leu Leu Leu Arg Leu 
260 " 265 270 

Vai" Asp Asp Phe Leu Leu Vai Thr Pro His Leu Thr His Ala Lys Thr 
275 230 235 

Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys Vai Val 
290 295 300 

Asn L^u A^g Lys Thr Val Vai Asn Phe Pro Val Glu. Asp Glu Ala Leu 
305 310 315 320 

Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe Pro Trp 
325 330 335 

Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser Asp Tyr 
340 345 350 

Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Val Thr Phe Asn Arg 
355 360 365 

Gly Phe Lvs Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly Val Leu 
370 ' 375 380 

A^g Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser Leu 
335 390 395 400 
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Gin Thr Val Cys Thr Asn lie Tyr Lys He Leu Leu Leu Gin Ala Tyr 
405 410 415 

Arg Phe His Ala Cys Val Leu^Gln Leu Pro Phe His Gin Gin Val Trp 
420 425 430 

Lys Asn Pro Thr Phe Phe Leu Arg Val He Ser Asp Thr Ala Ser Leu 
435 440 445 

Cys Tyr Ser He Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala 
450 455 * 460 

Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu Cys 
465 * 470 475 480 

His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val 
435 _ 490 495 

Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg Lys 
500 505 510 

Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu- Ala Ala Ala Asn Pro Ala 
515 520 525 

Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
5 20 * 5 35 



(2) INFORMATION FOR SEQ ED NO: 60 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 amino acid- 
(3) TYPE: amino acid 
( C) STPANDEDMESS : 
■*0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: orotein 



(ix) FEATURE: 

(A) NAME /KEY : Protein 
(3) LOCATION: 1 . . 530 

(D) OTHER INFORMATION: /note- "fusion protein composed of 

glutathione's- cransfe rase sequence, 
thrombin cleavage sequence, recognition 
sequence for heart muscle protein 
kinase, residues introduced by cloning, 
eight consecutive His residues and hTRT 
protein fragment" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:603: 

Met Ser Pro He Leu Gly Tyr Trp Lys He Lys Gly Leu Val Gin Pro 

1 5 10 15 

Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 * 30 

Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 































- 
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1 Gi y 


Leu 
50 


Glu 


Phe 


Pro 


Asn 


Leu 
55 


Pro 


Tyr 


Tyr 


He 


Asp 
60 


Giy 


Asp 


Val 


Lys 




n 

U Leu 
| 65 


Thr 


Gin 


Ser 


Met 


Ala 
70 


lie 


Tie 


Arg 


Tyr 


He 

75 " 


Ala 


Asp 


Lys 


His 


Asn 
80 




E 


Leu 


Giy 


Giy 


Cys 
85 


Pro 


Lys 


Glu 


Arg 


Ala 
90 


Glu 


He 


Ser 


Met 


Leu 
95 


Glu 




B 

| ■ Giy 


Ala 


Val 


Leu 
100 


Asp 


He 


Arg 


Tyr 


Giy 
105 


Val 


Ser 


Arg 


He 


Ala 
110 


Tyr 


Ser 




1 Lys 

s 


Asp 


Phe 
115 


Glu 


Thr 


Leu 


Lys 


Val 
120 


Asp 


Phe 


Leu 


Ser 


Lys 
125 


Leu 


Pro 


Glu 




1 Met 


Leu 
130 


Lys 


Met 


Phe 


Glu 


Asp 
135 


Arg 


Leu 


Cys 


His 


Lys 
140 


Thr 


Tyr 


Leu 


Asn 




Giy 
145 


Asp 


His 


Val 


Thr 


His 
150 


Pro 


Asp 


Phe 


Met 


Leu 
155 


Tyr 


Asp 


Ala 


Leu 


Asp 
160 




Val 


VaL 


Leu 


Tyr 


Met 
165 


Asp 


Pro 


Met 


Cys 


Leu 
170 


Asp 


Ala 


Phe 


Pro 


Lys 
175 


Leu 




"Val 


Cys 


Phe 


Lys 
180 


Lys 


Arg 


lie 


Glu 


Ala 
135 


lie 


Pro 


Gin 


lie 


Asp 
190 


Lys 


Tyr 




Leu 


Lys 


Ser 
195 


Ser 


Lys 


Tyr 


lie 


Ala 
200 


Trp 


Pro 


Leu 


Gin 


Giy 
205 


Trp 


Gin 


Ala 




Thr 


Phe 
210 


Giy 


Giy 


Giy 


Asp 


His 
215 


Pro 


Pro 


Lys 


Ser 


Asp 
220 


Leu 


Val 


Pro 


Arg 




Giy 
225 


Ser 


Arg 


Arg 


Ala 


Ser 
230 


Vai 


Giy 


Ser 


Vai 


His 
235 


His 


H i s 


His 


His 


His 
240 




His 


K i s 


Giy 


Ser 


Val 
245 


Thr 


Lys 


Me t- 


Ser 


Vai 
250 


Tyr 


Val 


Vs i 


Glu 


Leu 
255 


Leu 




Arg 


Ser 


Phe 


Phe 
260 


Tyr 


Vai 


Thr 


Giu 


Thr 
265 


Thr 


Phe 


Gin 


Lys 


Asn 
270 


Arg 


Leu 




Phe 


Phe 


Tyr 
275 


Arg 


Pro 


Ser 


Val 


Trp 
230 


Ser 


Lys 


Leu 


Gin 


Ser 
235 


lie 


Giy 


He 




Arg 


Gin 
290 


His 


Leu 


Lys 


Arg 


Val 
295 


Gin 


Leu 


Arg 


Glu 


Leu 
300 


Ser 


Glu 


Ala 


Glu 




Val. 
305 


Arg 


Gin 


His 


Arg 


Glu 
310 


Ala 


Arg 


Pro 


Ala 


Leu 
315 


Leu 


Thr 


Ser 


Arg 


Leu 
320 




Arg 


Phe 


lie 


Pro 


Lys 
325 


Pro 


Asp 


ui y 


Leu 


Arg 
330 


Pro 


lie 


Val 


Ash 


Met 
335 


Asp 




Tyr 


Val 


Val 


Giy 
340 


Ala 


Arg 


Thr 


Phe 


Arg 
345 


Arg 


Glu 


Lys 


Arg 


Ala 
350 


Glu 


Arg 




Leu 


Thr 


Ser 
355 


Arg 


Val 


Lys 


Ala 


Leu 
360 


Phe 


Ser 


Val 


Leu 


As n 
365 


Tyr 


Glu 


Arg 
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Ala A-q Arq Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp 
370 375 380 

He His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp 



385 



390 



395 



400 



Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr 
405 410 415 

Aso Th- lie Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He 
420 425 430 

Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys 
43"5 4 40 4 45 

Ala Ala His Gly His Val Arg Lys Ala Phe. Lys Ser His Val Ser Thr 
450 455 . 460 

Leu Tr- As d Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin 
4 65 ^ 470 4 75 480 

Glu T-— Ser Pro Leu Arg Aso Ala Val Val lie Glu Gin Ser Ser Ser 
435 490 495 

Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met, 
500 305* 310 

Cys K-s His Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin 
515 520 525 



INfOP.XATIOM EOR SEQ ID MO:604: 

(1) SEQUENCE: CHARACTERISTICS: 

;A) LENGTH: 515 amino acids 
H ) TYPE: amino acid 
:Z) STRANDEQMESS : 
; 2) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



;ix} FEATURE: 

(A) NAME /KEY : Protein 
(3) LOCATION: 1 . .515 
(0) OTHER INFORMATION: 



/note- "fusion protein composed of 
glutathione-S-t rans f erase sequence, 
thrombin cleavage sequence, recognition 
sequence for heart muscle protein 
kinase, residues introduced by cloning 
and hTRT protein fragment" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 



Met Ser Pro lie 'Leu Gly Tvr Trp Lys lie Lys Gly Leu Val Gin Pro 
1 5 10 15 

































----- - 
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||? T In r 


Arg 


Leu 


Leu 
20 


Leu 


Glu 


Tyr 


Leu 


Glu 
25 


Glu 


Lys 


Tyr 


Glu 


Glu 
30 


His 


Leu 


^ Tvr 


Glu 


Arg 
35 


Asp 


Glu 


Gly 


Asp 


Lys 
40 


Trp 


Arg 


Asn 


Lys 


Lys 
45 


Phe 


Glu 


Leu 


£ Gl y 

i 


Leu 
50 


Glu 


Phe 


Pro 


Asn 


Leu 
55 


■ Pro 


Tyr 


Tyr 


lie 


Asp 
60 


Gly 


Asp 


Val 


Lys 


H 

1 65 


Thr 


Gin 


Ser 


Met 


Ala 
70 


lie 


lie 


Arg 


Tyr 


He 
75 


Ala 


Asp 


Lys 


His 


Asn 
fi n 


1 ■ Met 

l 


Leu 


Gly 


Gly 


Cys 
85 


Pro 


Lys 


Glu 


Arg 


Ala 
90 


Glu 


lie 


~Ser 


Met 


Leu 
-/ -j 


Glu 


Gly 


Ala 


Val 


Leu 
100 


Asp 


lie 


Arg 


Tyr 


Gly 
105 


Val 


Ser 


Arg 


He 


Ala 
110 


Tyr 


Ser 


Lys 


Asp 


Phe 
115 


Glu 


Thr 


Leu 


Lys 


Val 
120 


Asp 


Phe 


Leu 


Ser 


Lys 
125 


Leu 


Pro 


Glu 


Met: 


Leu 
130 


L y s 


Met 


Phe 


Glu 


Asp 
135 


Arg 


Leu 


Cys 


Kis 


Lys 
14 0 


Thr 


Tyr 


Le u 


Asn 


G L y- 
14 5 


Asp 


His 


~Val 


Thr 


His 
150 


Pro 


Asp 


Phe 


Met 


Leu 
155 


Tyr 


Asp 


Ala 


Leu 


Asp 
160 


Va 1 


Val 


l e u 


Tyr 


Met 
165 


Asp 


Pro 


Me t 


Cys 


Leu 
170 


Asp 


Ala 


Phe 


Pro 


Lys 
17 5 


Leu 


VaL 


Cys 


Phe 


Lys 
180 


Lys 


Arg 


He 


Glu 


Ala 
135 


lie 


Pro 


Gin 


He 


Asp 
190 


Lys 


Tyr 


Leu 


Lys 


195 


Ser 


Lys 


Tyr 


lie 


Ala 
200 


T rp 


Pro 


Leu 


Gin 


Giy 
205 


Trp 


Gin 


Ala 


Thr 


210 


G 1 7 


Gly 


Gly 


Asp 


His 
215 


Pro 


Pro 


Lys 


Ser 


As p 

2 20 


Leu 


Val 


Pro 


Arg 


Giy 
225 


Ser 


Arg 


Arg 


Ala 


Ser 
230 


Val 


Gly 


Ser 


Val 


235 


Lys 


i-:e- 


See 


Val 


2 4 0 


Val 


Val 


Glu 


Leu 


Leu 
245 


Arg 


Ser 


Phe 


Phe 


Tyr 
250 


Val 


Thr 


Giu 


Thr 


Thr 
25 5 


Phe 


Gin 


Lys 


Asn 


Arg 
260 


Leu 


Phe 


Phe 


Tyr 


Arg 
265 


Pro 


Ser 


Val 


Trp 


Ser 
270 


Lys 


Leu 


Gin 


Ser 


lie 
275 


Giy 


lie 


Arg 


Gin 


His 
230 


Leu 


Lys 


A.rg 


Val 


Gin 
235 


Leu 


Arg 


Glu 


Leu 


Ser 
290 


Glu 


Ala 


Glu 


Val 


Arg 
295 


Gin 


His 


Arg 


Glu 


Ala 
300 


Arg 


Pro 


Ala 


Leu 


Leu 

.30 5 


Thr 


Ser 


Arg 


Leu 


Arg 
310 


Phe 


lie 


Pro 


Lys 


Pro 
315 


Asp 


Gly 


Leu 


Arc 


Pro 
320 


lie 


Val 


Asn 


Met 


Asp 
325 


Tyr 


Val 


Val 


Gly 


Ala 
330 


Arg 


Thr 


Phe 


Arg 


Arg 
335 


Glu 



£1 . 
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Lys Arg Ala Glu Arg Leu Thr Ser Arg Lys Ala Leu Phe Ser Val Leu 
340 345 350 

Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Val Leu 
355 360 365 

Gly Leu Asp Asp lie His Arg Ala Trp Arg Thr Phe Val Leu Arg Val 
370 375 380 

Arg Ala Gin Asp Pro Pro Pro Glu Tyr Phe Val Lys Val Asp Val Thr 
3 35 390 395 400 

Gly Ala Tyr Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val He Ala 
405 410 415 

Ser lie lie Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val 
420 425 430 

Val Gin Lys Ala Ala His Gly Val Arg Lys Ala Phe Lys Ser His Val 
435 440 445 

Ser Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His 
450 455 460 

Leu Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val lie Glu Gin Ser 
465 _ 470 475 430 

Ser Ser Leu Asn Glu Ala Ser Gly Leu Phe Asp Val Phe Leu Arg Phe 
435 490 495 

Met Cys His His Ala Val Arg lie Arg Gly Lys Ser Tyr Val Gin Cys 
500 505 510 

Gin Gly lie 
515 



2} r;-:rORMAT:OM EOR SEQ ID N ! O:605: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 amino acids 
(3) TYPE: amino acid 
(C) STRANDEDNESS: 
(0) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY : Protein 
(3) LOCATION: 1 . . 515 

(D) OTHER INFORMATION: /note- "fusion protein composed of 

glutathione-S-transferase sequence, 
thrombin cleavage sequence, recognition 
sequence for heart muscle protein 
kinase, residues introduced by cloning 
and hTRT protein fragment" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

Met Ser Pro lie Leu Gly Tyr Trp Lys lie Lys Gly Leu Val Gin Pro 
1 5 10 15 

Thr Arg Leu Leu Leu Giu Tyr Leu Giu Giu Lys Tyr Giu Giu His Leu 
20 25 30 

Tyr Giu Arg Asp Giu Gly Asp Lys Trp Arg Asn Lys Lys Phe Giu Leu 
35 40 45 

Gly Leu Giu Phe Pro Asn Leu Pro Tyr Tyr lie Asp Gly Asp Val Lys 
50 55 60 

Leu Thr Gin Ser Met Ala lie lie Arg Tyr lie Ala Asp Lys His Asn 
65 70 75 80 

Met Leu Gly Gly Cys Pro Lys Giu Arg Ala Giu lie Ser Met Leu Giu 
85 90 95 

Giy Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg He Ala Tyr Ser 
100 105 110, 

Lys As? Phe Giu Thr Leu Lys Val Asp Phe . Leu Ser Lys Leu Pro Giu 
- " 115 120 ^ 125 

Met Leu Lys Met Phe Giu Asp Arg Leu Cys Kis Lys Thr Tyr Leu Asn 
- 130 135 140 

Giy Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 " 150 * 155 160 

Val Val Leu Tyr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys Leu 
165 * 170 - 175 

Val Cys Phe Lys Lys Arg lie Giu Ala lie Pro Gin lie Asp Lys Tyr 
180 " 185. 190 

Leu Lys Ser Ser Lys Tyr lie Ala Trp Pro Leu Gin Gly Trp Gin Ala 
195 ^ 200 205 

Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Vai Pro Arg 
210 * 215 220 

Gly Ser Arg Arg Ala Ser Val Gly Ser Val Thr Lys Ala Thr Ser Leu 
225 230 235 240 

Giu Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg 
245 250 255 

Gin His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp 
260 265 270 

Asp Thr Pro Cys Pro Pro Val Tyr Ala Giu Thr Lys Kis Phe Leu Tyr 
275 280 235 

Ser Ser Gly Asp Lys Giu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser 
290 " 295 300 

Leu Arg Pro Ser Leu Thr. Gly Ala Arg Arg Leu Val Giu Thr lie Phe 
305 310 315 320 
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Leu Gly Ser Arg 



Pro Trp Met 
325 



Pro Gly Thr 
330 



Pro Arg Arg 



Leu Pro Arg 
335 



Leu Pro Gin Arg 
340 

Gly Asn His Ala 
355 

Pro Leu Arg Ala 

370 



Tyr Trp Gin 
Gin Cys Pro 



Ala Val Thr 
.375 



Met Arg Pro 
345 

Tyr Gly Val 
360 

Pro Ala Ala 



Leu Phe Leu 



Leu Leu Lys 
365 

Gly Val Cys 
330 



Glu Leu Leu 
350 

Thr His Cys 
Ala Arg Glu 



Lys Pro Gin Gly 
335 



Ser Val Ala 
390 



Ala Pro Glu 



Glu Glu Asp 
395 



Thr Asp Pro 
400 



Arg. Arg Leu Val 



Gin Leu Leu 
405 



Arg Gin His 
- 4 10 



Ser Ser Pro 



Trp Gin Val 
415 



Tyr Gly Phe Val 
420 



Arg Ala .Cys 



Leu Arg Arg 
425 



Leu Val Pro 



Pro Gly Leu 
430 



Trp Gly Ser Arg 
435 



His Asn Glu 



Arg Arg Phe 
440 



Leu Arg Asn 
4 45 



Thr Lys Lys 



Phe 11-5 Ser Leu 
450 - 

Trp Lys Met Ser 
4 65 



Gly Lys His 
4 5 5 

Val Arg Asp 
470 



Ala Lys. Leu 
Cys Ala Trp 



Ser Leu Gin 
<!60 

Leu Arg Arg 
475 



Glu Leu Thr 



Ser Pro Gly 
430 



Val Gly Cys VaJ 



■ Pro Ala Ala 
485 



Glu His Arg 
4 90 



Leu Arg Glu 



Glu lie Leu 
■ 495 



Ala Lys Phe Leu 
500 



His T r o Leu 



Met Ser Val 
505 



Tvr Val Val 



Glu Leu Leu 
510 



ICIFORXATIOM cQR SEQ 10 NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 517 amino acids 
( 3 } TYPE : amino acid 

(C) STRAND EDM ESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(i:<) FEATURE: 

(A) NAME/ KEY : Protein 
(3) LOCATION: 1 . . 517 

(D) OTHER INFORMATION: /note 3 "fusion protein composed of 

glutathione-S-transferase sequence, 
thrombin cleavage sequence, recognition 
sequence for heart muscle protein 
kinase, residues introduced by cloning 
and hTRT protein fragment" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:606: 

Met Ser Pro lie Leu Gly Tyr Trp Lys lie Lys Gly Leu Val Gin, Pro 
1 5 -10 15 

Thr Arg Leu Leu Leu Glu Tyr Leu Glu Glu Lys Tyr Glu Glu His Leu 
20 25 30 

Tyr Glu Arg Asp Glu Gly Asp Lys Trp Arg Asn Lys Lys Phe Glu Leu 
35 40 45 

Gly Leu Glu Phe Pro Asn Leu Pro Tyr Tyr lie Asp Gly Asp Val Lys 
50 55 60 " 

Leu Thr Gin Ser Met Ala lie lie Arg Tyr lie Ala Asp Lys His Asn 
65 70 75 - 80 

Met Leu Gly Gly Cys .Pro Lys Glu "Arg Ala Glu lie Ser Met Leu Glu 
85 90 95 

Gly Ala Val Leu Asp He Arg Tyr Gly Val Ser Arg lie Ala Tyr Ser 
100 105 110 

Lys Asp Phe Glu Thr Leu Lys Val Asp Phe Leu Ser Lys Leu Pro Glu 
"l 15 " 120 125 

Met Leu Lys Met Phe Glu -Asp Arg Leu Cys His Lys Thr Tyr Leu Asn 
130 . * 135 140 

Gly Asp His Val Thr His Pro Asp Phe Met Leu Tyr Asp Ala Leu Asp 
145 150 4 155 160 



'yr Met Asp Pro Met Cys Leu Asp Ala Phe Pro Lys 



Va 1 Val Le , 

165 • 170 . 175 



Val Cys Phe Lys Lys Arg lie Glu Ala He Pro Gin lie Asp Lys Tyr 

180 185 _ 190 

Leu Lys Ser Ser Lys Tyr He Ala Trp Pro Leu Gin Gly Trp GIrr Ala 

195 " 200 * 2C5 

Thr Phe Gly Gly Gly Asp His Pro Pro Lys Ser Asp Leu Val Pro Arg 

■ 210 * 215 220 

Gly Ser Arg Arg Ala Ser Val Gly Ser Val Thr Lys Met Pro Arg Ala 

225 230 235 240 

Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Ser His Tyr Arg Glu Val 

245 250 255 

Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro Gin Gly Trp Arg 

260 265 270" 

Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala Leu Val Ala Gin 

275 * 280 235 

Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro Ala Ala Pro Ser 

290 295 300 

Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val Ala A.rg Val Leu Gin 

305 310 ' 315 320 
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Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu Ala Phe Gly Phe Ala 
325 330 335 

Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu Ala Thr Thr Ser Val 
340 345 350 

Arg Ser Tyr Leu Pro Asa Thr Val Thr Asp Ala Leu Arg Gly Ser Gly 
355 360 365 

Ala Trp Gly Leu Leu Leu Arg Arg Val Gly Asp Asp Val Leu Val His 
370 375 380 

Leu Leu Ala Arg Cys Ala Leu Phe Val Leu Val Ala Pro Cys Ala Tyr 
385 . 390 395 400 

Gin Val Cys Gly Pro Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala 
405 410 415 

Arg Pro Pro Pro His Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu 
420 425 430 

Arg Ala Trp Asn His Ser Val Arg Glu Ala Gly Val Pro Leu Gly Leu 
4 3 5, 4 4 0 4 4 5 

Pro Ala Pro Gly- Ala Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu 
4 50 4 55 ' -460 

Pro Leu Pro Lys Arg Pro Arg Arg Gly Ala Ala Pro Glu Pro Glu Arg 
465 470 475 430 

Thr Pro Val Gly Gin Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly 
435 490 495 

Pro Ser Asp Arg Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu 
500 " 505 510 

Glu Ala Thr Ser Leu 
515 



INFORMATION FOR SEQ ID MO: "607: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE-: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(8) LOCATION: 1 . . 30 

CD) OTHER INFORMATION: /note* "oligonucleotide used to modify 

hTRT cDNA nucleotides 779 to 781 from 
GCG to CAT" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 
CCGGCCACCC CCCATATGCC GCGCGCTCCC 30 



(2) INFORMATION FOR SEQ ID NO: 608: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acici 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - ■ 

( B ) LOCATION: 1 . .58 

(0) OTHER INFORMATION: /note= "oi igonucieot ide used to create 

expression vectors pGRN127, pGRM128 and 
pGRNl 30 " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 608: 
TGCGCACGTG GGAAGCCCTG GC AG ATCTG A ATTCCACCAT GCCGCGCGCT CCCCGCTG 58 



(2) INFORMATION FOR SEQ ID NO; 609: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 60 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
(DJ TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 
(■B)- LOCATION: 1 . . 60 

(D) OTHER INFORMATION: /note- " ol igonuc leo c ide used to create 

expression vectors pGRML29 and pGRM130" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 
CGGGACGGGC TGCTCCTGCG TTTGGTGGAC GCGTTCTTGT TGGTGACACC TCACCTCACC 60 



(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 

As.n Ser Ala Vai Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



ix) FEATURE: 

(A) NAME /KEY : Protein 

(B) LOCATION: 1..1154" 

(D) OTHER INFORMATION: /note* "fusion protein composed of hTRT 

protein sequence, vector sequences, the 
Myc epitope and His6 tag" 



SEQUENCE DESCRIPTION: SEQ ID NO : 6 1 1 : 

'[■let Pro Arg Ala Pro Arg Cys Arg Aia Vai Arg Ser Leu Leu Arg Ser 

1 5 10 15 

His Tyr Arg Giu Vai Leu Pro Leu Aia Thr Phe Vai Arg Arg Leu Gly 
20 25 30 

Pro Gin Gly Trp Arg Leu Vai Gin Arg Gly Asp Pro Aia Ala Phe Arg 
3 5 * 40 . * 4 5 

Ala Leu Vai Ala Gin Cys Leu Vai Cys Vai Pro Trp Asp Ala Arc Pro 
50 55 60 

' Pro Pro Ala Ala Pro Ser ?he Arg Gin Vai- Ser Cys Leu Lys Glu Leu 
65 70 75 ■ SO 

Vai Aia Arg Vai Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Vai 
85 * 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro . Pro 
100 105 110 

Glu Ala Phe Thr Thr Ser Vai Arg Ser Tyr Leu Pro Asn Thr Vai Thr 
115 120 125 

Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Vai 
130 135 140 

Gly Asp Asp Vai Leu Vai His Leu Leu Ala Arg Cys Ala Leu Phe Vai 
145 * 150 155 160 



Leu Vai Ala Pro Ser Cys Ala Tyr Gin Vai Cys Gly Pro Pro Leu Tyr 
165 170 175 
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Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
180 185 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala- Trp Asn His Ser Val Arg 
195 -200 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 215 220 

Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg . 
225 230 235 240 

Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 

Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly' Phe Cys Val 
260 265 270 

Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 

290 ' 295- . 300 

Glv D ^o" Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 

305 ' 3.10 " 315 320 

Cys Pro -Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Aso Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie PT.e Leu -Gly Ser 
355 350 ~ 365 

--g D ro Tro Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pre Gin 
370 * 375 330 

i-q Tyr ^ro Gin Met -Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
335 ' 390 395 400 

Ala Gin'Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu .Asp Thr Asp Pro Arg Arg Leu 
435 ' 440 445 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
450 455 460 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
465 470 475 480 

Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
435 490 495 
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Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arg Asp Cys Ala Trp Leu Arg Arg-Ser Pro Gly Val Gly Cys 
515 520 525 

Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe 
530 535 540 ■ 

Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 550 555 ' 560 

Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asa Arg Leu Phe Phe Tyr 
565 570 575 

Arg Lys Ser Val Trp Ser Lys Leu Gin Ser lie Gly lie Arg Gin His 
580 585 ' 590 

Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe lie 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met Asp Tyr Val Val 
625 ' * 630 ' 635 ' 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 . 655 

Arc Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg- Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp lie His Arg 
675 * 630 685 

Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 " 695 700 

Glu leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 " 710 " k 715 720 

Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin 
725 730 735 

Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 
740 745 750 

Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 
755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 775 780 

Pro Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu 
785 790 795 800 



Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys ' *His His 
805 - 810 815 
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Ala Vai Arg lie Arg Gly Lys Ser Tyr Vai Gin Cys Gin Gly lie Pro 
820 825 830 

Gin Gly Ser lie Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
835 340 845 

Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gl.y Leu Leu Leu 
850 855 360 

Arg Leu Vai Asp Asp Phe Leu Leu Vai Thr Pro His Leu Thr His Ala 
865 870 875 880 

Lys Thr Phe Leu Arg Thr Leu Vai Arg Gly Vai Pro Glu Tyr Gly Cys 
885 890 895 

Vai Vai Asn Leu Arg Lys Thr Vai Vai Asn Phe Pro Vai Glu Asp Glu 
900 905 910 

Ala Leu Gly Gly Thr Aia Phe Vai Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Tro Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Vai Gin Ser 
930 935 9-40 

Asp Tvr Ser Ser Tyr Ala Arg Thr Ser He Arg Ala Ser Leu Thr Phe 
945 ' 950 - 955 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
" 965 970 975 

Vai Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Vai Asn 
930 935 990 

Ser Leu Gin Thr Vai Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin 
995 * 1000 1005 

■ Ala Tvr Arg Phe His Ala Cys Vai Leu Gin Leu Pro Phe His GLn Gin 
10 10 1015 10 20 

Vai Trp Lys Asn Pro Thr Phe Phe Leu Arg Vai lie Ser Asp Thr Aia 
1025 ' 1030 ' 1035 1040 

Ser Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 
1045 1050 1055 

Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Vai Gin Trp 
1060 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Vai Thr 
1075 1030 1085 

Tyr Vai Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 1100 

Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn 
1105 1110 1115 1120 

Pro Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp Leu Glu Gin Lys 
1125 1130 1135 
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Leu lie Ser Glu Glu Asp Leu Asn Ser Ala Val Asp. His His His His 
1140 1145 1150 

His His 



(2) INFORMATION FOR SEQ ID NO: 612: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

{A) NAME /KEY : Protein 
(B) LOCATION: 1 . . 1200 

(D) OTHER INFORMATION: /note= "fusion protein composed of His6 

and Anti-Xpress tags, enterokinase 
cleavage site and-full length hTRT 
protei n" 



;>;i) SEQUENCE DESCRIPTION: SEQ IDNO:612: 

Met Pro Arg Giy Ser His His His His His His Giy Met Ala Ser Met 
1 5 10 15 

Thr Giy Giy Gin Gin Met Giy Arg Asp Leu Tyr Asp Asp Asp Asp Leu 
20 " 25^ * 30' 

Asp Pro- Ser Ser Arg Ser Ala Ala Giy Thr Met Glu Phe Ala Ala Ala 
'25 4 0 4 5 

Ser Thr Gin Arg Cys Val Leu Leu Arg Thr Trp Glu Ala Leu Ala Pro 
50 / 55 60 

Ala Thr Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arc Ser 
65 70 75 80 

Leu Leu Arg Ser His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val 
85 90 95 

Arg Arg Leu Giy Pro Gin Giy Trp 'Arg Leu Val Gin Arg Giy Asp Pro 
100 105 110 

Ala Ala Phe Arg Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp 
115 120 125 

Asp Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys 
130 135 140 

Leu Lys Glu Leu Val Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Giy 
145 150 155 - " 160 



Ala Lys Asn Val Leu Ala Phe Giy Phe Ala Leu Leu Asp Giy Ala Arg 
165 170 175 
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Gly Gly Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro 
180 ' 185 190 

Asn Thr Val Thr Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu 
195 200 205 

Leu Arg Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys 
210 215 220 

Ala Leu Phe Val Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly 
225 230 235 240 

Pro Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro 
245 250 255 

His Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn 
260 265 270 

His Ser Val Arg Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly 
275 230 285 

Ala Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser.Leu'Pro Leu Pro Lys 
290 *■ 295 300 

Arg Pro Arg- Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly 
305 ' 310 315 320 

Gin Gly Ser Trp Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg 
325 330 335 

Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser 
34 0 34 5 350 

Leu Glu_ Gly Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly 

355 360 .365, 

Arg GIr. His His Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro 
370 375 360 

Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu 
335 * 390 * 395 * 400 

Tyr Ser Ser Gly Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser 
405 410 415 

Ser Leu Arg Pro Ser Leu Thr Gly Ala A.rg Arg Leu Val Glu Thr lie 
420 425 430 

Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro 
4 35 4 40 44 5 

Arg Leu Pro Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu 
450 455 460 

Leu Gly Asn His Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His 
465 _ 470 475 480 

Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg 
485 490 495 
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Giu Lys Pro Gin Gly Ser Vai Ala Ala Pro Giu Giu Giu Asp Thr Asp 
500 505 510 

Pro Arg Arg Leu Vai Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin 
515 520 " 525 

Vai Tyr Gly Phe Vai Arg Ala Cys Leu Arg Arg Leu Vai Pro Pro Gly 
530 535 540 

Leu Trp Gly Ser Arg His Asn Giu Arg Arg Phe Leu Arg Asn Thr Lys 
545 550 555 560 

Lys Phe lie Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gin Giu Leu 
565 570 575 

Thr Tro Lys Met Ser Vai Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro 
580 585 '590 

Gly Vai Glv Cys Vai Pro Ala Ala Giu His Arg Leu Arg Giu Giu lie 
595 600 605 

Leu Ala Lys Phe Leu His Trp Leu Met Ser Vai Tyr Vai Vai Giu Leu 
610 - 615 620 

Leu Arg Ser Phe Phe Tyr Vai Thr Giu Thr Thr Phe Gin Lys Asn Arg 
625 " 630 ~ 635 640 

Leu Phe Phe Tyr Arg Lys Ser Vai Trp Ser Lys Leu Gin Ser He Gly 
645 650 655 

lie Arg Gin His Leu Lys Arg Vai Gin Leu Arg Giu Leu Ser Giu Ala 
660 665 670 

Giu Vai Arc Gin His Arg Giu Ala Arg Pro Ala Leu Leu Thr Ser Arg 
575 " 680 635 

Leu Arg Phe lie Pro Lys Pro Asp Gly Leu Arg Pro He Vai Asr. Met 
690 695 * _7O0 

Asp Tyr Vai Vai Gly Ala Arg Thr Phe Arg Arg Giu Lys Arg Ala Giu 
705 * 710 715 ' 720 

Arg Leu Thr Ser Arg Vai Lys Ala Leu Phe Ser Vai Leu Asn Tyr Giu 
725 730 735 

Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser Vai Leu Gly Leu Asp 
740 745 750 

Asp He His Arg Ala Trp Arg Thr Phe Vai Leu Arg Vai Arg Ala Gin 
755 760 765 

Asp Pro Pro Pro Giu Leu Tyr Phe Vai Lys Vai Asp Vai Thr Gly Ala 
770 775 780 

Tyr Asp Thr lie Pro Gin Asp Arg Leu Thr Gl.u Vai He Ala Ser He 
785 " 790 795 300 

He Lys Pro Gin Asn Thr Tyr Cys Vai Arg Arg Tyr Ala Vai Vai Gin 
805 310 815 
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Lys Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser 
320 825 330 

Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu 
335 " 840 845 

Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val He Glu Gin Ser Ser 
850 855 860 

Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe 
365 870 375 880 

Met Cys His His Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys 
885 890 895 

Gin Gly lie Pro Gin Gly Ser-Ile Leu Ser Thr Leu Leu Cys Ser Leu 
900 - 905 910 

Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp 
915 920 925 

Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His 
930 935 940 

Leu Thr His Ala Lvs Thr Phe. Leu Arg Thr. Leu Val Arg 'Gly Val Pro 
945 * 950 955 960. 

Glu Tvr Gly Cvs Val Val Asn Leu Arg Lys Thr Val Va.l Asn' Phe Pro 
965 970 975 

Val Glu Aso Glu Ala Leu Gly Gly thr Ala Phe Val Gin Met Pro Ala 
980 * 935 990 

His Glv Leu Phe Pro Tro Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu 
995 * 1000 1005 

Glu Val Gin Ser Asp Tyr Ser' Ser Tyr Ala Arg Thr Ser lie Arg Ala 
1010 _ 1015 1020 

Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg 
1025 1030 1035 1040 

Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp 
1045 1050 1055 

Leu Gin Val Asn Ser Leu Gin Thr Val Cys Thr Asn He Tyr Lys lie 
1060 1065 1070 

Leu Leu Leu Gin Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro 
1075 1030 1085 

■ Phe His Gin Gin Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val He 
1090 " 1095 HOO 

Ser Aso Thr Ala Ser Leu Cys Tyr Ser 'lie Leu Lys Ala Lys Asn Ala 
1105 ^ 1110 1115 1120 

Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu 
1125 1130 H35 
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Ala Val Gin Trp Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg 
1140 1145 1150 

His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin 
1155 1160 1165 

Thr Gin Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu 
1170 1175 1180 

Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1185 1190 1195 1200 



(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : _ 1189 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY linear 

(ii) MOLECULE TYPE: protein 



ix) FEATURE: 

(A) NAME /KEY : Protein 
(3) LOCATION: 1 . .118 9 

(D) OTHER INFORMATION: /note= "fusion protein composed of 

melittin signal sequence and full length 
hTRT orotein" 



SEQUENCE DESCRIPTION: SEQ ID NO: 613: 

Met Lvs Phe Leu Val Asn Val Ala Leu Val Phe Met Val Val Tyr lie 
1^ ~ 5 '10 15 

Ser Tyr lie Tyr Ala Aso Pro Ser Ser Arg Ser Ala Ala Gly Thr Met 
20 " 25 30 

Glu Phe Ala Ala Ala Ser Thr Gin Arg Cys Vai Leu Leu Arg Thr Trp 
35 40 45 

Glu Ala Leu Ala Pro Ala Thr Pro Ala Met Pro Arg Ala Pro Arr Cys 
50 * 55-60 

Arc f-la Vai Arg Ser Leu Leu Arc Ser His Tyr Arg Glu Val Leu Pro 
65' 70 75 80 

Leu Ala Thr Phe Val Arg Arg Leu Gly Pro Gin Gly Trp Arg Leu Val 
85 90 95 

Gin Arg Gly Asp Pro Ala Ala Phe * Arg Ala Leu Val Ala Gin Cys Leu 
100 10 5 110 

Val Cvs Val Pro Tro Aso Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe 
115 120 125 

Arg Gin Vai Ser Cys Leu Lys Glu Leu Val Ala Arg Val Leu Gin Arg 
130 135 140 
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Leu Cys Glu Arg Gly Ala Lys Asn Val Leu Ala Phe Gly Phe Ala Leu 
145 150 155 160 

Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu Ala Phe Thr Thr Ser Val 
165 170 175 

A.rg Ser Tyr Leu Pro Asn Thr Val Thr Asp Ala Leu Arg Gly Ser Gly 
180 185 190 

Ala Trp Gly Leu Leu Leu Arg Arg Val Gly Asp Asp Val Leu Val His 
195 200 205 

Leu Leu Ala Arg Cys Ala Leu Phe Val Leu Val Ala Pro Ser Cys Ala 
■ 210 215 220 

Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin 
225 230 235 240 

Ala Arg Pro Pro Pro His Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys 
245 250 255 

Glu Arg Ala Trp Asn His Ser Val Arg Glu Ala Gly Val Pro Leu Gly 
260 265 _ 270 

Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser 
275 280 - 235 

Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly Ala Ala Pro Glu Pro Glu 
290 295 300 

Arg Thr Pro Val Gly Gin Gly Ser Trp Ala His Pro Giy Arg Thr Arg 
305 310 315 320 

Glv Pro Ser Aso Aro Gly Phe Cys Val Val Ser Pro Ala Arg Pro Ala 
325 330 335 

GLu Glu. Ala Thr "Ser Leu Glu Giy Ala Leu Ser Giy Tr.r Arg His Ser 
340 345 350 

His Pro Ser Val Gly Arg Gin His His' Ala Gly Pro Pro Ser Thr Ser 
355 \ 360 365 

Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys Pro Pro Val Tyr Ala Glu 
' 370 " 375 380_ 

Thr Lys His Phe Leu Tyr Ser Ser Giy Asp Lys Glu Gin Leu Arg Pro 
385 390 395 400 

Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg 
405 410 415 

Leu Val Glu Thr He Phe Leu Gly Ser Arg Pro Trp Met Pro Gly Thr 
420 425 430 

Pro Aro Arg Leu Pro Arg Leu Pro Gin Arg Tyr Trp Gin Met Arg Pro 
4 35 " 4 40 4 45 



Leu Phe Leu Glu Leu Leu Gly Asn His Ala Gin Cys- Pro Tyr Gly Val 
450 455 460 
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Leu Leu Lys Thr His Cys Pro Leu Arg Ala Ala Val Thr Pro Ala Ala 
465 470 475 -80 

Gly Val Cys Ala Arg Glu Lys Pro Gin Gly Ser Val Ala Ala Pre Glu 

Giu Glu Asp Thr Asp Pro Arg Arg Leu Val Gin Leu Leu Arg Glr. His 
500 505 510 

Ser Ser Pro Trp Gin Val Tyr Gly Phe Val Arg Ala Cys Leu Arc Arg 
515 520 525 

Leu Val Pro Pro Glv Leu Trp Gly Ser Arg His Asn Glu Arg Arg Phe 
530 535 540 

Leu Arg Asn Thr Lys Lys Phe lie Ser Leu Gly Lys His Ala Lys Leu 
545 550 555 =60 

Ser Leu Gin Glu Leu Thr Trp Lys Met Ser" Val Arg Asp Cys Ala Trp 
565 570 57= 

Leu » rg Arg Ser Pro Gly Val Gly Cys Val Pro Ala Ala Glu His Arg 
580 585 - 590 

L»u arg Glu Glu He Leu Ala Lys Phe Leu His Trp Leu Met Ser" Val 
595 600 605 

Tyr Val Val Glu Leu Leu Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr 
610 615 620 

Phe Gin Lvs Asn Arg Leu Phe Phe Tyr Arg Lys Ser Val Trp Ser Lys 
625 '630 635 o40 

7„n r.in S »- Gly lie Arg Gin His Leu Lys Arg Val Gin Leu Arg 

" 64 5 650 65; 

Glu t-u Ser Giu Ala Glu Val Arg Gin His Arg Glu Ala Arg Pre Ala . 
660 665 670 

L-u Leu Thr Ser Arg "leu Arg Phe_ He Pro Lys Pro Asp Gly Leu Arg 
675 680 

Pro Tie Val Asn Met Asp Tyr Val Val Gly Ala Arg Thr Phe Arg Arg 
690 695 7 °0 

Glu Lvs Arg 'Ala Glu Arg Leu Thr Ser Arg Val Lys Ala Leu Phe Ser 
705 ■ ' "710 ' _ 7 15 720 

Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro Gly Leu Leu Gly Ala Ser 
725 7 30 7j = 

Val Leu Gly Leu Asp Asp He His Arg Ala Trp Arg Thr Phe Val Leu 
740 ' 7 45 75 0 

Pro Val Arg Ala Gin Asp Pro Pro Pro Glu Leu Tyr Phe Val Lys Val 
755 760 _ 7 55 

Asp Val Thr Gly Ala Tyr Asp Thr He Pro Gin Asp Arg Leu Thr Glu 
770 77 5 780 
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Val He Ala Ser lie He Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg 
785 ' 790 ^95 _ 800 

Tyr Ala Val Val Gin Lys Ala Ala His Gly His Val Arg Lys Ala Phe . 

805 810 815 

Lys Ser His Val Ser Thr Leu Thr Asp Leu Gin Pro Tyr Met Arg Gin 
820 825 830 

Phe Val Ala His Leu Gin Glu Thr Ser Pro Leu Arg Asp Ala Val Val 
835 840 845 

He Glu Gin Ser Ser Ser Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp 
850 855 860 

Val Phe Leu Arg Phe Met Cys His His Ala Val Arg lie Arg Gly Lys 
865 870 875 880 

S-r Ty- Val Gin Cys Gin Gly He Pro Gin Gly Ser He Leu Ser Thr 
885 890 895 

Leu Leu Cys Ser Leu Cys Tyr Gly Asp Met Glu Asn Lys Leu Phe Ala 
. 900 505 . 910 

Gly Me Arg Arg Aso Gly Leu Leu Leu Arg Leu Val Asp Asp Phe Leu 
915 " 920 ■ 925 

Leu Val Thr Pro His Leu Thr His Ala Lys Thr Phe Leu Arg Thr Leu 
930 935 940 

Val Aro Gly Val Pro Glu Tyr Gly- Cys Val Val Asn Leu Arg Lys Thr 
945 * 950 955 960 

-al *sn =>h* Dt "o Val Glu Aso Glu Ala Leu Gly Gly Thr Ala Phe 

965 970 975 

Val- Gin Met "Pro Ala His Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu 
930 55 5 1 990 

isp Th _ Arg Thr Leu Giu val Gin Ser Asp Tyr Ser Ser Tyr Ala Arg 
995 1000 1005 

Thr Ser lie Arg Ala Ser Leu Thr Phe Asn Arg Gly Phe Lys Ala Gly 
1010 " 1°1 5 - 1020 

Arq Asn Met Ara Arg Lys Leu Phe Gly Val Leu Arg Leu Lys Cys His 
1025 * * ■ 1030 1035 1040 

Ser L*u Phe- Leu Asp Leu Gin Val Asn Ser Leu Gin Thr Val Cys Thr 
1045 1050 1055 

Asn Tyr Lys He Leu Leu Leu Gin Ala Tyr Arg Phe His Ala Cys 

1060 1065 1070 

Val Leu Gin Leu Pro Phe His Gin Gin Val Trp Lys Asn Pro Thr Phe 
1075 1080 1085 

Phe Leu Arg Val He Ser Asp Thr Ala Ser Leu Cys Tyr Ser He Leu 
1090 10S5 1100 
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Lys Ala Lys Asn Ala Gly Met Ser Leu Gly Ala Lys Gly Ala Ala Gly 
1105 mo 1H5 

Pro Leu Pro Ser Glu Ala Val Gin Trp Leu Cys His Gin Ala PheLeu 
1125 H30 113o 

Leu Lys Leu Thr Arg His Arg Val Thr Tyr Val Pro Leu Leu Gly Ser 
1140 H50 

Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg Lys Leu Pro Gly Thr Thr 
1155 H60 H 65 

Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe 
1170 H75 H 80 

Lys Thr lie Leu Asp 
1185 



INFORMATION FOR SEQ ID NO: 614: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 92 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS : single 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



;ix) FEATURE: 

(A) NAME /KEY: - 

' B ) LOCATION: 1.-92 . 

'D) OTHER INFORMATION: /note- "oligonucleotide used to create 
' ' excression vectors QGRN128 anc ?«oRN129 

SEQUENCE DESCRIPTION: SEQ 12 NO: 614: 



r, 60 

TGCGGCC GCCACCGCGG TGGAGCTCCA GC 



TCAG ACTTCAAGAC CAT C-C T G G AC _ T A C .--AG G AC G ACGATGACAA ATGAATTCAG 

92 



INFORMATION FOR SEQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 53 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA ' 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 53 

( D ) OTHER INFORMATION: /no:e* "mutagenic oligo used toadda 

CSP45I site at the C-termxnus o: hTRl 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 
CTTCAAGACC ATCCTGGACT TTCGAAACGC GGCCGCCACC GCGGTGGAGC TCC 



(2) INFORMATION FOR SEQ ID NO: 616: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: . 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 33 

(D) OTHER INFORMATION: /note= "oligonucleotide RA4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 
GCCACCCCCG CGCTGCCTCG AGCTCCCCGC TGC 



(2) INFORMATION FOR SEQ ID MO: 617: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 8 base pairs 
(3) TYPE: nucleic acid 
;C) STRANDEDNESS: single 
( D ) TOPOLOGY: linear 

(li) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1.-48 

(D) OTHER INFORMATION: /note- "primer hTRrl" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 
GGGGAAGCTT TAATACGACT CACTATAGGG TTGCGGAGGG TGGGCCTG 



(2) INFORMATION FOR SEQ ID NO: 618: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 36 

(D) OTHER INFORMATION: /note- "primer hTR+44 5" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 618: 
CCCCGGATCC TGCGCATGTG TGAGCCGAGT CCTGGG 



(2) INFORMATION FOR SEQ ID NO: 619: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 base pairs 

(B) TYPE: nucleic acid 
_(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 62 

(D) OTHER INFORMATION: /note- "synthetic telomerase product" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 
A7TCCGTCGA GCAGAGTTAG GGTTAGGGTT AGGGTTAGGG ■ TTAGGGTTAG GGTTAGGGTT 60 

62 



INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
(3) TYPE: nucleic ac-id 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . .18 

(D) OTHER INFORMATION: /note- "primer M2" 



(xi) SEQUENCE DESCRIPTION: 
AATCCGTCGA GCAGAGTT 



SEQ ID NO:620: 



18 
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(2) INFORMATION FOR SEQ ID NO: 621: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION : 1 . . 18 

(D) OTHER INFORMATION: /note- "primer H3.03" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621: 
TTAGGGTTAG GGTTAGGG 



(2) INFORMATION 'FOR SEQ ID NO:622: 

(i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

Lxi) SEQUENCE DESCRIPTION: SEQ ID NO:622: 

18 

AAICCGTCGA GCAGAGGG - - 

(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: 

18 

AATCCGTCGA GC AG A TAG 

(2) INFORMATION FOR SEQ ID NO: 62 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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{ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..84 

(D) OTHER INFORMATION: /note- "T701 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 624 : 
GGGAGATCTT AATACGACTC ACTATAGATT CAGGCCATGG TGCTGCGCCG GCTGTCAGGC 
TCCCACGACG TAGTCCATGT TCAC 

(2) INFORMATION FOR SEQ ID NO: 625: 

(i) SEQUENCE CHARACTERISTICS: " 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . .'32 

(D) OTHER INFORMATION: /note= "reverseOl primei 



(xi") SEQUENCE DESCRIPTION: SEQ ID NO: 625: 
'CTAGAT CCGGAAGAGT GTCTGGAGC A AG 



(2) INFORMATION FOR SEQ ID NO: 62 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 base pairs 
(3) TYPE: nucleic acid 
(■C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . .84 

(D) OTHER INFORMATION: /note- "T702 primer '* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:626: 
GGGAGATCTT AATACGACTC ACTATAGATT CAGGCCATGG TGCTGCGCCG GCTGTCAGGG 
CGGCCTTCTG GACCACGGCA TACC 



362 



(2) INFORMATION FOR SEQ ID NO: 627: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. . 30 

(D) OTHER INFORMATION: /note- "reverse02 primer 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 
GGTCTAGACG ATATCCACAG GGCCTGGCGC 



(2) INFORMATION FOR SEQ ID NO: 628: 

(if SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1407 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(ix) FEATURE: 

(A) NAME /KEY : Protein 

(3) LOCATION: 1 . . 1 4 0 7 _ 

( D) " OTHER INFORMATION: /note- "fusion protein composed of -; 

enhanced green fluorescent protein 
LEGE?) residues, residues encoded by the 
5' untranslated region of hTRT rr\RNA and 
hTRT orptein sequence" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623: - 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 "5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie 
35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
-50 55 60 

Leu Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 
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Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gl-n Glu 
85 90 95 

Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly 
115 120 125 

lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys Leu Glu Tyr 
130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

Gly lie Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 1*70 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 - 200 205 

Ser Lys Asd Pro Asn Glu Lvs Arg Asp His Met Val Leu Leu Glu Phe 
210 ^ 215 220 

Val Thr Ala Ala Gly He Thr Leu "Gly Met Asp Glu Leu Tyr Lys Ser 
225 230 235 240 

Gly r-a Thr Gin Tie Ser Ser Ser Ser Phe Glu Phe Ala Ala Ala Ser 
245 250 255 

Thr Gin Arg Cvs Val Leu Leu Arc Thr Trp Glu Ala Leu Ala Pro Ala 
260 265 270 

Thr Pro Ala Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu : 
27 5 230 2 85 

Leu A:: Ser His Tvr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg 
290 " 295 300 

Arg L~u Gly Pro Gin Gly Tro Arc .Leu Val Gin Arg Gly Asp Pro Ala 
305 310 " 315 320 

Ala Phe Arg Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp 
325 330 335 

Ala Arg Pro Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu 
340 34 5 • 350 

Lys Glu Leu Val Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala 
355 360 365 

Lys Asn Val Leu Ala Phe Glv Phe Ala Leu Leu Asp Gly Ala Arg Gly 
370 375 330 

Glv Pro Pro Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn 
385 390 395 400 
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Thr Val Thr Asp Ala Leu Arc Gly Ser Gly Ala Trp Gly Leu Leu Leu 
405 410 415 

Arg Arg Val Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys.Ala 
420 425 430 

Leu Phe Val Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro 
435 440 445 

Pro Leu Tyr Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His 
450 455 460 

Ala Ser Gly Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His 
465 470 475 480 

Ser Val Arg Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala 
485 490 495 

Arg Arg Arg Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg 
500 505 510 

o ro pr-g Arg Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin 
515 520 . 525 

Gly Ser Tro Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arc Gly 
530 * 535 - 540 

Phe Cys Val Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu 
545 550 555 560 

Glu Glv Ala Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg 
565 570 575 

Gin His h'< s Ala Glv Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp 
530 * 535 5.90 

z S o Thr" Pro Cys Pro Pro Val Tyr "Ala Glu Thr Lys His Phe Leu Tyr 

595 SOO tC5 

S^r Ser Gly Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser 
610 615 620 

Leu Arg Pro Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe 
625 630 _ 635 640 

Leu Gly Ser Arg Pro Tro Met Pro" Gly Thr Pro Arg Arg Leu Pro Arg 
645 650 655 

Leu Pro Gin Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu 
660 665 670 

Gly Asn His Ala Gin Cvs Pro Tyr Gly Val Leu Leu Lys Thr His Cys 
675 680 635 

Pro Leu Arg Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu 
690 695 700 

Lys P-o Gin Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro 
705 710 715 720 
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Arg Arg Leu Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val 
725 730 735 

Tyr Gly Phe Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu 
740 745 750 

Trp Gly Ser Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys 
755 760 765 

Phe lie Ser Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr 
770 775 780 

Trp Lys Met Ser Val Arg Asp Cys Ala Trp Leu Arg Arg Ser Pro Gly 
785 790 795 800 

Val Gly Cys Val Pro Ala Ala Glu His Arg Leu. Arg Glu Glu lie Leu 
805 810 815 

Ala Lys Phe Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu 
820 825 830 

Arg Ser Phe Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu 
835 " 840 845 

Phe Phe Tyr Arg Pro Ser Val Trp Ser Lys Leu Gin Ser He Gly He 
350 " 855 860 

Arg Gin His Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu 
865 870 875 380 

Val Arg Gin His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu 
885 890 895 

Aro Phe He Pro Lys Pro Asp Gly Leu Arg Pro lie Val Asn Met: Asp. 
900 * . 905 910 

Tvr Val Val Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg 
915 " 920 525 

Leu Thr Ser Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg 
930 935 9^0 

Ala Arg Aro Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp 
945 ' 950 955 960 

He His Arg Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp 
965 970 975 

Pro Pro Pro Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr 
980 985 990 

Asp Thr lie Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser lie He 
995 * 1000 ■ 1005 

Lys Pro Gin Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys 
1010 1015 _ 1020 

Ala Ala His Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr 
1025 1030 1035 1040 
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Leu Thr Asp . Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin 
1045 1050 1055 

Glu Thr Ser Pro Leu Arg Asp Ala Val Val lie Glu Gin Ser Ser Ser 
1060 1065 1070 

Leu Asn Glu Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met 
1075 1080 1085 

Cys His His Ala Val Arg lie Arg Gly Lys Ser Tyr Val Gin Cys Gin 
1090 1095 1100 

Gly lie Pro Gin Gly Ser lie Leu Ser Thr Leu Leu Cys Ser Leu Cys 
1105 1110 1115 1120 



Tyr Gly Asp Met Glu Asn Lys Leu 
1125 

Leu Leu Leu Arg Leu Val Asp Asp 
1140 

Thr His Ala Lys Thr Phe Leu Arg 
1155 1160 

Tyr Gly Cys Val Val Asn Leu Arg 

1170 1175 " 

Glu Asp Glu Ala Leu Gly Gly Thr 
1185 * 1190 



Phe Ala Gly lie Arg Arg Asp Gly 
1130 1135 

Phe Leu Leu Val Thr Pro His Leu 
1145 1150 

Thr Leu Val Arg Gly Val Pro Glu 
1165 

Lys Thr Val Val Asn Phe Pro Val 
1180 

Ala Phe Val Gin Met Pro Ala His 
1195 1200 



Gly Leu Phe Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu 
1205 * 1210 1215 

Val Gin Ser Asp Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser 
1220 * 1225 1230 

Val Thr Phe Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arc Lys 
123 5 12 4 0 12 4 5 

Leu Phe Gly Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu 

1250 1255 1260 

Gin Val Asn Ser Leu Gin Thr Val. Cys Thr Asn lie Tyr Lys lie Leu 
1265 1270 - 1275 1280 

Leu Leu Gin Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe 
1285 1290 1295 

His Gin Gin Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser 
1300 1305 1310 



Asp Thr Ala Ser Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly 
1315 1320 ' 1325 

Met Ser Leu Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala 
1330 1335 1340 

Val Gin Trp Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His 

1345 1350 1355 1360 



Arg Vai Thr Tyr Vai Pro Leu Leu Gly Ser Leu Arg Thr" Ala Gin Thr 
1365 1370 1375 



Gin Leu Ser Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala 
1380 1385 1390 

Ala Ala Asn Pro Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1395 1400 1405 



(2) INFORMATION FOR SEQ ID NO: 629: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY f - 

(B) LOCATION: 1 . . 18 

{.D) ■ OTHER INFORMATION: /note= "TS primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 629: , 
AATCCGTCGA GCAGAGTT 



■12) INFORMATION FOR SEQ ID MO: 630: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
(3) TYPE: nucleic acid 

(C) S T RAN D E DN E S S : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: - 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note= "ACX primer" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 
GCGCGGCTTA CCCTTACCCT TACCCTAACC 



(2) INFORMATION FOR SEQ ID NO: 631: 

(i) SEQUENCE CHARACTERISTICS: 
.(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
'(D) TOPOLOGY: linear 



368 



(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. . 18 

(D) OTHER INFORMATION: /note- "U2 primer" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 
ATCGCTTCTC GGCCTTTT 18 

(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 36 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 36 

(D) OTHER INFORMATION: /note= "TSU2 primer' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 
AA7CCGTCGA GCAGAGTTAA AAGGCCGAGA- AGCGAT 3 6 

(2} INFORMATION FOR SEQ ID NO: 63 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i te 
{ 3) LOCATION.: 1 

(D) OTHER INFORMATION: /product* "OTHER" 

/note= "Xaa « hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, 
Trp or Met" 
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(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si t e 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note^ "Xaa = hydrophobic amino acid, 
Ala, Leu, He, Val, Pro, Phe, 
■ Trp or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 

Xaa Arg Xaa Xaa Pro Lys 
1 5 



INFORMATION FOR SEQ ID NO: 634 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH:" 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-si te 
(3) LOCATION: 1 

(D) OTHER INFORMATION: /product- "OTHER" 

/note- "Xaa - Phe or Leu" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied-si te 

(3) LOCATION: 3 - 

!D) OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa hydrophobic amino, acid, 
Ala , Leu , lie, Val, Pro, Phe , 
Trp or Met" 

(ix) FEATURE: 

(A) NAME /KEY : Modif ied-site 
{3} LOCATION: 7 

(D) OTHER INFORMATION: /product* "OTHER" 

/note- "Xaa « hydrophobic amino acid, 
■ Ala, Leu, He, Val, Pro, Phe, 
Tro or Me t " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634: 



Xaa Arg Xaa He Xaa Xaa Xaa 
1 ■ 5 
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(2) INFORMATION FOR SEQ ID NO: 635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i te 

(B) LOCATION: 6 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, 
Trp or Met" 

(ix) FEATURE-: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 8 

(D) OTHER INFORMATION: /produ-ct- "OTHER" 

/note- "Xaa - hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro., Phe, 
Trp or Met" 

(ix) FEATURE: 

(A) NAME / KEY : Modi f ied-s i te 
(3) LOCATION: 10 

(D) OTHER INFORMATION: /product= "OTHER" 

/note- "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, 
Trp or Met" 



>:i) SEQUENCE DESCRIPTION: SEQ 10 NO: 635: 

Pro Xaa Leu Tyr Phe. Xaa Xaa Xaa Asp Xaa Xaa Xaa Cys Tyr Asp Xaa 
1 5 10 15 

He 



(2) INFORMATION FOR SEQ ID MO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i t e 

(B) LOCATION: 18 

(D) OTHER INFORMATION: /product- "OTHER " 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, 
Trp or Met" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

Lys Xaa Tyr Xaa Gin Xaa Xaa Gly lie Pro Gin Gly Ser Xaa Leu Sex 
1 5 10 15 



Xaa Xaa Leu 



(2) INFORMATION FOR_ SEQ ID NO: 637: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



FEATURE: 
(A) NAME /KEY : Modi f i ed- S i t e 
(3) LOCATION: 10 

(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa = hydrophobic amino acid, 
Ala, Leu, lie, Val, Pro, Phe, 
Trp or Met/' 



;:•:!) .SEQUENCE DESCRIPTION: SEQ ID MO: 637: 

Leu Leu Arg Leu Xaa Asp Asp Xaa Leu Xaa lie Thr 
1 5 10 



2) INFORMATION FOR SEQ ID NO: 638: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3396 base' pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

( B) LOCATION : 1 . . 3396 - , . _ 

(D) OTHER INFORMATION: /note* "hTRT-encoding sequence employing 

alternative codon distributions for 
E. coli (all genes) " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

ATGCCGCGCG CGCCGCGCTG CCGCGCGGTG CGCAGCCTGC TGCGCAGCCA TTATCGCGAA 60 

GTGCTGCCGC TGGCGACCTT TGTGCGCCGC CTGGGCCCGC AGGGCTGGCG CCTGGTGCAG 120 

CGCGGCGATC CGGCGGCGTT TCGCGCGCTG GTGGCGCAGT GCCTGGTGTG CGTGCCGTGG 180 

GATGCGCGCC CGCCGCCGGC GGCGCCGAGC TTTCGCCAGG TGAGCTGCCT GAAAGAACTG 24 0 

GTGGCGCGCG TGCTGCAGCG CCTGTGCGAA CGCGGCGCGA AAAACGTGCT GGCGTTTGGC 300 

TTTGCGCTGC TGGATGGCGC GCGCGGCGGC CCGCCGGAAG CGTTTACCAC CAGCGTGCGC 360 

AGCTATCTGC CGAACACCGT GACCGATGCG CTGCGCGGCA GCGGCGCGTG GGGCCTGCTG 4 20 

CTGCGCCGCG TGGGCGATGA TGTGCTGGTG CATCTGCTGG CGCGCTGCGC GCTGTTTGTG 4 80 

CTGGTGGCGC CGAGCTGCGC GTATCAGGTG TGCGGCCCGC CGCTGTATCA GCTGGGCGCG 54 0 

GCGACCCAGG CGCGCCCGCC GCCGCATGCG AGCGGCCCGC - GCCGCCGCCT GGGCTGCGAA 600 

CGCGCGTGGA ACCATAGCGT GCGCGAAGCG GGCGTGCCGC TGGGCCTGCC GGCGCCGGGC 660 

GCGCGCCGCC GCGGCGGCAG CGCGAGCCGC AGCCTGCCGC T GC CGAAACG CCCGCGCCGC 720 

GGCGCGGCGC CGGAACCGGA ACGCACCCCG GTGGGCCAGG GCAGCTGGGC GCATCCGGGC 7 80 

CGCACCCGCG GCCCGAGCGA TCGCGGCTTT TGCGTGGTGA GCCCGGCGCG CCCGGCGGAA 84 0 

GAAGCGACCA GCCTGGAAGG CGCGCTGAGC GGCACCCGCC ATAGCCATCC GAGCGTGGGC 900 

CGCCAGCATC ATGCGGGCCC GCCGAGCACC AGCGGCCCGC CGCGCCCGTG GGATACCCCG 9 60 

TGCCCGCCGG TGTATGCGGA AA C C .AAA CAT TTTCTGTATA GCAGCGGCGA TAAAGAACAG 1020 

CTGCGCCCGA GCTTTCTGCT GAGCAGCCTG CGCCCGAGCC 7GACCGGCGC GCGCCGCCTG -1080 

GTGGAAACCA TTTTTCTGGG CAGCCGCCCG TGGATGCCGG GCACCCCGCG CCGCCTGCCG 1140 

CGCCTGCCGC AGCGCTATTG GCAGATGCGC CCGCTGTTTC TGGAACTGCT GGGCAACCAT 1200 

GCGCAGTGC'C ■ CGTATGGCGT GCTGCTGAAA ACCCATTGCC CGCTGCGCGC GGCGGTGACC 1260 

CCGGCGGCGG GCGTGTGCGC GCGCGAAAAA CCGCAGGGCA GCGTGGCGGC GCCGGAAGAA 1320 

GAAGATACCG ATCCGCGCCG CCTGGTGCAG CTGCTGCGCC AGCATAGCAG CCCGTGGCAG 1380 

GTGTATGGCT TTGTGCGCGC GTGCCTGCGC CGCCTGGTGC CGCCGGGCCT GTGGGGCAGC 14 4 0 

CGCCATAACG AACGCCGCTT TCTGCGCAAC A C C AAAAAA T TTATTAGCCT GGGCAAACAT 1500 

GCGAAACTGA GCCTGCAGGA ACTGACCTGG AAAATGAGCG TGCGCGATTG CGCGTGGC7G 15 60 

CGCCGCAGCC CGGGCGTGGG CTGCGTGCCG GCGGCGGAAC ATCGCCTGCG CGAAGAAA7T 1620 

CTGGCGAAAT TTCTGCATTG GCTGATGAGC GTGTATGTGG TGGAACTGCT GCGCAGCTTT 168 0 

TTTTATGTGA CCGAAACCAC CTTTCAGAAA AACCGCCTGT TTTTTTATCG CAAAAGCGTG 17 4 0 

TGGAGCAAAC TGCAGAGCAT TGGCATTCGC CAGCATCTGA AACGCGTGCA GCTGCGCGKh 18 00 









3TT~ 








g| CTGAGCGAAG 


CGGAAGTGCG 


CCAGCATCGC 


GAAGCGCGCC 


CGGCGCTGCT 


GACCAGCCGC 


1860. 


E CTGCGCTTTA 


TTCCGAAACC 


GGATGGCCTG 


CGCCCGATTG 


TGAACATGGA 


TTATGTGGTG 


1920 


1 GGCGCGCGCA 


CCTTTCGCCG 


CGAAAAACGC 


GCGGAACGCC 


TGACCAGCCG 


CGTGAAAGCG 


1980 


E CTGTTTAGCG 


TGCTGAACTA 


TGAACGCGCG 


CGCCGCCCGG 


GCCTGCTGGG 


CGCGAGCGTG 


2040 


| CTGGGCCTGG 


ATGATATTCA 


TCGCGCGTGG 


CGCACCTTTG 


TGCTGCGCGT 


GCGCGCGCAG 


2100 


H GATCCGCCGC 


CGGAACTGTA 


TTTTGTGAAA 


GTGGATGTGA 


CCGGCGCGTA 


TGATACCATT 


2160 


| CCGCAGGATC 


GCCTGACCGA 


AGTGATTGCG 


AGCATTATTA 


AACCGCAGAA 


CACCTATTGC 


2220 


| GTGCGCCGCT 


ATGCGGTGGT 


GCAGAAAGCG 


GCGCATGGCC 


ATGTGCGCAA 


AGCGTTTAAA 


2280 


11 AGCCATGTGA 


GCACCCTGAC 


CGATCTGCAG 


CCGTATATGC 


GCCAGTTTGT 


GGCGCATCTG 


2340 


B CAGGAAACCA 


GCCCGCTGCG 


CGATGCGGTG 


GTGATTGAAC 


AGAGCAGCAG 


CCTGAACGAA 


2400 


1 . GCGAGCAGCG 


GCCTGTTTGA 


TGTGTTTCTG 


CGCTTTATGT 


GCCATGATGC 


GGTGCGCATT 


2460 


I . CGCGGCAAAA. 


GCTATGTGCA 


GTGCCAGGGC 


ATTCCGCAGG 


GCAGCATTCT 


GAGCACCCT.G 


2520 


1 CTGTGCAGCC 


TGTGCTATGG 


CGATATGGAA 


AACAAACTGT 


TTGCGGGCAT 


TCGCCGCGAT 


2580 


GGCCTGCTGC 


TGCGCCTGGT 


GGATGATTTT- 


CTGCTGGTGA 


CCCCGCATCT 


GACCCATGCG 


2640 


AAAACCTTTC 


TGCGCACCCT 


GGTGCGCGGC 


GTGCCGGAAT 


ATGGCTGCGT 


GGTGAACCTG 


2700 


CGCAAAACCG 


TGGTGAACTT 


TCCGGTGGAA 


GATGAAGCGC 


TGGGCGGCAC 


CGCGTTTGTG 


2760 


CAC-ATGCCGG 


CGCATGGGCT 


GTTTCCGTGG 


TGCGGCCTGC 


TGCTGGATAC 


CCGCACCCTG 


2820 


GAAGTGCAGA 


GCGATTATAG 


CAGCTATGCG 


CGCACCAGCA 


TTCGCGCGAG 


CCTGACCTTT 


2880 


AACCGCGGCT 


TTAAAGCGGG 


CCGCAACATG 


CGCCGCAAAC 


TGTTTGGCGT 


GCTGCGCCTG 


2 94 0 


AAATGCCA7A 


GCCTGTTTCT 


GGATCTGCAG 


GTGAACAGCC 


TGCAG'ACCGT 


GTGCACCAAC 


3000 


ATTTA.TAAAA 


TTCTGCTGCT 


GCAGGCGTAT 


CGCTTTCATG 


CGTGCGTGCT 


GCAGCTGCCG 


3060 


TTTCATCAGC 


AGGTGTGGAA 


AAACCCGACC 


TTTTTTCTGC 


GCGTGATTAG 


CGATACCGCG 


3120 


AGCCTGTGCT 


ATAGCATTCT 


GAAAGCGAAA 


AACGCGGGCA 


TGAGCCTGGG 


CGCGAAAGGC 


3180 ■ 


GCGGCGGGCC 


CGCTGCCGAG 


CGAAGCGGTG 


CAGTGGCTGT 


GCCATCAGGC 


GTTTCTGCTG 


3240 


AAACTGACCC 


GCCATCGCGT 


GACCTATGTG 


CCGCTGCTGG 


GCAGCCTGCG 


CACCGCGCAG 


3300 


ACCCAGCTGA 


GCCGCAAACT 


GCCGGGCACC 


ACCCTGACCG 


CGCTGGAAGC 


GGCGGCGAAC 


3360 


CCGGCGCTGC 


CGAGCGATTT 


TAAAACCATT 


CTGGAT 






..3396 
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(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..3396 

CD) OTHER INFORMATION: /note= "hTRT-encoding sequence employing 

alternative codon distributions for 
enteric bacteria (high expressing 
genes ) " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:639: 

ATGCCGCGTG CTCCGCGTTG CCGTGCTGTT CGTTCCCTGC TGCGTTCCCA CTACCGTGAA ' 60 

GTTCTGCCGC TGGCTACCTT CGTTCGTCGT CTGGGTCCGC AGGGTTGGCG TCTGGTTCAG 120 

CGTGGTGACC CGGCTGCTTT CCGTGCTCTG GTTGCTCAGT GCCTGGTTTG CGTTCCGTGG 180 

GACGCTCGTC CGCCGCCGGC TGCTCCGTCC TTCCGTCAGG TTTCCTGCCT GAAAGAACTG 240 

GTTGCTCGTG TTCTGCAGCG TCTGTGCGAA CGTGGTGCTA AAAACGTTCT GGCTTTCGGT 300 

TTCGCTCTGC TGGACGGTGC TCGTGGTGGT CCGCCGGAAG CTTTCACCAC CTCCGTTCGT 3 60 

TCCTACCTGC CGAACACCGT TACCGACGCT CTGCGTGGTT CCGGTGCTTG GGGTCTGCTG '4 20 

CTGCGTCGTG TTGGTGACGA CGTTCTGGTT CACCTGCTGG CTCGTTGCGC TCTGTTCGTT 4 80 

CTGGTTGCTC CGTCCTGCGC TTACCAGGTT TGCGGTCCGC CGCTGTACCA GCTGGGTGCT 54 0 

GCTACCCAGG CTCGTCCGCC GCCGCACGCT TCCGGTCCGC GTCGTCGTCT GGGTTGCG^ 600 

CGTGCTTGGA ACCACTCCGT TCGTGAAGCT GGTGTTCCGC TGGGTCTGCC GGCTCCGGGT 6 60 

GCTCGTCGTC GTGGTGGTTC CGCTTCCCG-T TCCCTGCCGC TGCCGAAACG TCCGCGTCGT 7 20 

GGTGCTGCTC CGGAACCGGA ACGTACCCCG GTTGGTCAGG GTTCCTGGGC TCACCCGGGT 7 80 

CGTACCCGTG GTCCGTCCGA CCGTGGTTTC TGCGTTGTTT CCCCGGCTCG TCCGGCTGAA 8 4 0. 

GAAGCTACCT CCCTGGAAGG TGCTCTGTCC GGTACCCGTC ACTCCCACCC GTCCGTTGGT 900 

CGTCAGCACC ACGCTGGTCC GCCGTCCACC TCCCGTCCGC CGCGTCCGTG GGACACCCCG 9 60 

TGCCCGCCGG TTTACGCTGA AACCAAACAC TTCCTGTACT CCTCCGGTGA CAAAGAACAG 1020 

CTGCGTCCGT CCTTCCTGCT GTCCTCCCTG CGTCCGTCCC TGACCGGTGC TCGTCGTCTG 1080 

GTTGAAACCA TCTTCCTGGG TTCCCGTCCG TGGATGCCGG GTACCCCGCG TCGTCTGCCG 114 0 

CGTCTGCCGC AGCGTTACTG GCAGATGCGT CCGCTGTTCC TGGAACTGCT GGGTAACCAC 1200 
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GCTCAGTGCC 


CGTACGGTGT 


TCTGCTGAAA 


ACCCACTGCC 


CGCTGCGTGC 


TGCTGTTACC 


1260 


CCGGCTGCTG 


GTGTTTGCGC 


TCGTGAAAAA 


CCGCAGGGTT 


CCGTTGCTGC 


TCCGGAAGAA 


1320 


GAAGACACCG 


ACCCGCGTCG 


TCTGGTTCAG 


CTGCTGCGTC 


AGCACTCCTC 


CCCGTGGCAG 


1380' 


GTTTACGGTT 


TCGTTCGTGC 


TTGCCTGCGT 


CGTCTGGTTC 


CGCCGGGTCT 


GTGGGGTTCC 


1440 


CGTCACAACG 


AACGTCGTTT 


CCTGCGTAAC 


ACCAAAAAAT 


TCATCTCCCT 


GGGTAAACAC 


1500 


GCTAAACTGT. 


CCCTGCAGGA 


ACTGACCTGG 


AAAATGTCCG 


TTCGTGACTG 


CGCTTGGCTG 


1560 


CGTCGTTCCC 


CGGGTGTTGG 


TTGCGTTCCG 


GCTGCTGAAC 


ACCGTCTGCG 


TGAAGAAATC 


1620 


CTGGCTAAAT 


TCCTGCACTG 


GCTGATGTCC 


GTTTACGTTG 


TTGAACTGCT 


GCGTTCCTTC 


1680 


TTCTACGTTA 


CCGAAACCAC 


CTTCCAGAAA 


AACCGTCTGT 


TCTTCTACCG 


TAAATCCGTT 


1740 


TGGTCCAAAC 


TGCAGTCCAT 


CGGTATCCGT 


CAGCACCTGA 


AACGTGTTCA 


GCTGCGTGAA 


1800 


CTGTCCGAAG 


CTGAAGTTCG 


TCAGCACCGT 


GAAGCTCGTC 


CGGCTCTGCT. 


GACCTCCCGT 


1860 


CTGCGTTTCA 


"TCCCGAAACC 


GGACGGTCTG 


CGTCCGATCG 


TTAACATGGA 


CTACGTTGTT 


1920 


GGTGCTCGTA 


CCTTCCGTCG 


TGAAAAACGT 


GCTGAACGTC 


TGACCTCCCG 


TGTTAAAGCT 


1980 


CTGTTCTCCG 


TTCTGAACTA 


CGAACGTGCT 


CGT.CGTCCGG 


GTCTGCTGGG 


TGCTTCCGTT 


2040 


CTGGGTCTGG 


ACGACATCCA 


CCGTGCTTGG 


CGTACCTTCG 


TTCTGCGTGT 


TCGTGCTCAG 


2100 


GACCCGCCGC 


CGGAACTGTA 


CTTCGTTAAA 


GTTGACGTTA 


CCGGTGCTTA 


CGACACCATC 


2160 


CCGCAGGACC 


GTCTGACCGA 


AGTTATCGCT 


TCCATCATCA 


AACCGCAGAA 


CACCTACTGC. 


2220 


GTTCGTCGTT 


ACGCTGTTGT 


TCAGAAAGCT 


GCTCACGGTC 


ACG7TCGTAA 


AGCTTTCAAA 


2280 


TCCC ACGTTT 


CCACCCTGAC 


CGACCTGCAG 


CCGTACATGC 


GTCAGTTCGT 


-TGCTCACCTG 


2-3 4 0 


CAGGAAACCT 


CCCCGCTGCG 


TGACGCTGTT 


GTTATCGAAC 


AGTCCTCCTC 


CCTGAACGAA 


2400 


GCTTCCTCCG 


GTCTGTTCGA 


CGTTTTCCTG 


CGTTTCATGT 


GCCACCACGC 


TGTTCGTATC 


24 60 


CGTGGTAAAT 


CCTACGTTCA 


GTGCCAGGGT 


ATCCCGCAGG 


GTTCCATCCT 


GTCCACCCTG 


2520 


CTGTGCTCCC 


TGTG'CTACGG 


TGACATGGAA 


AACAAACTGT 


' TCGCTGGTAT 


CCGTCGTGAC 


2580 


GGTCTGCTGC 


TGCGTCTGGT 


TGACGACTTC 


CTGCTGGTTA 


CCCCGCACCT 


GACCCACGCT 


2640 


AAAACCTTCC 


TGCGTACCCT 


GGTTCGTGGT 


GTTCCGGAAT 


ACGGTTGCGT 


TGTTAACCTG 


2700 


CGTAAAACCG 


TTGTTAACTT 


CCCGGTTGAA 


GACGAAGCTC 


TGGGTGGTAC 


CGCTTTCGTT 


2760 


CAGATGCCGG 


CTCACGGTCT 


GTTCCCGTGG 


TGCGGTCTGC 


TGCTGGACAC 


CCGTACCCTG 


2820 


GAAG l 1 CAG i 


LtL-rL 1 AL, 1 


C ILL. 1 r\Lov^ i 


CGTACCTCCA 


TCCGTGCTTC 


rrTGACCTTC 


2880 


AACCGTGGTT 


TCAAAGCTGG 


TCGTAACATG 


CGTCGTAAAC 


TGTTCGGTGT 


TCTGCGTCTG 


2940 


AAATGCCACT 


CCCTGTTCCT 


GGACCTGCAG 


GTTAACTCCC 


TGCAGACCGT 


TTGCACCAAC 


3000 


ATCTACAAAA 


TCCTGCTGCT 


GCAGGCTTAC 


CGTTTCCACG 


CTTGCGTTCT 


GCAGCTGCCG 


3060 
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TTCCACCAGC AGGTTTGGAA AAACCCGACC TTCTTCCTGC GTGTTATCTC CGACACCGCT 3120 

TCCCTGTGCT ACTCCATCCT GAAAGCTAAA AACGCTGGTA TGTCCCTGGG TGCTAAAGGT 3180 

GCTGCTGGTC CGCTGCCGTC CGAAGCTGTT CAGTGGCTGT GCCACCAGGC TTTCCTGCTG 324 0 

AAACTGACCC GTCACCGTGT TACCTACGTT CCGCTGCTGG GTTCCCTGCG TACCGCTCAG 3 300 

ACCCAGCTGT CCCGTAAACT GCCGGGTACC ACCCTGACCG CTCTGGAAGC" TGCTGCTAAC 3360 

CCGGCTCTGC CGTCCGACTT CAAAACCATC CTGGAC 3396 



(2) INFORMATION FOR SEQ ID NO: 640: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 3396 

(D) OTHER INFORMATION: /note= " hTRT-encoding sequence employing 

alternative codon distributions for 
yeast (all genes)" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 0: 

ATGCCAAGAG CTCCAAGATG TAGAGCTGTT AGATCTTTGT TGAGATCTCA TTATAGAGAA 60 

GTT7TGCCAT TGGCTACTTT " TGTTAGAAGA TTGGGTCCAC AAGGTTGGAG ATTGGTTCAA 120 

AGAGGTGATC CAGCTGCTTT TAGAGCTTTG GTTGCTCAAT GTTTGGTTTG TGTTCCATGG 180 

GATGCTAGAC CACCACCAGC TGCTCCATCT TTTAGACAAG TTTCTTGTTT GAAAGAAT7G 240 

GTTGCTAGAG TTTTGCAAAG ATTGTGTGAA AGAGGTGCTA AAAATGTTTT GGCTTTTGGT 300 

TTTGCTTTGT TGGATGGTGC TAGAGGTGGT CCACCAGAAG CTTTTACTAC TTCTGTTAGA 3 60 

TCTIATTTGC CAAATACTGT TACTGATGCT TTGAGAGGTT CTGGTGCTTG GGGTTTGTTG 4 20 

TTGAGAAGAG TTGGTGATGA TGTTTTGGTT CATTTGTTGG CTAGATGTGC TTTGTTTGTT 480 

TTGGTTGCTC CATCTTGTGC TTATCAAGTT TGTGGTCCAC CATTGTATCA ATTGGGTGCT 540 

GCTACTCAAG CTAGACCACC ACCACATGCT TCTGGTCCAA GAAGAAGATT GGGTTGTGAA 60 0 

AGAGCTTGGA ATCATTCTGT TAGAGAAGCT GGTGTTCCAT TGGGTTTGCC AGCTCCAGGT 660 

GCTAGAAGAA GAGGTGGTTC TGCTTCTAGA TCTTTGCCAT TGCCAAAAAG ACCAAGAAGA 7 2 0 

GGTGCTGCTC CAGAACCAGA AAGAACTCCA GTTGGTCAAG GTTCTTGGGC TCATCCAGGT 780 

AGAACTAGAG GTCCATCTGA TAGAGGTTTT TGTGTTGTTT CTCCAGCTAG ACCAGCTGAA 84 0 
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GAAGCTACTT CTTTGGAAGG TGCTTTGTCT GGTACTAGAC ATTCTCATCC ATCTGTTGGT 900 

AG AC AAC AT C ATGCTGGTCC ACCATCTACT TCTAGACCAC CAAGACCATG GGATACTCCA 960 

TGTCCACCAG TTTATGCTGA AACTAAACAT TTTTTGTATT CTTCTGGTGA TAAAGAACAA 102 0 

TTGAGACCAT CTTTTTTGTT GTCTTCTTTG AGACCATCTT TGACTGGTGC TAGAAGATTG 108 0 

GTTGAAACTA TTTTTTTGGG TTCTAGACCA TGGATGCCAG GTACTCCAAG AAGATTGCCA 114 0 

AGATTGCCAC AAAGATATTG GCAAATGAGA CCATTGTTTT TGGAATTGTT GGGTAATCAT 1200 

GCTCAATGTC CATATGGTGT TTTGTTGAAA ACTCATTGTC CATTGAGAGC TGCTGTTACT 1260 

CCAGCTGCTG GTGTTTGTGC TAGAGAAAAA CCACAAGGTT CTGTTGCTGC TCCAGAAGAA 132 0 

GAAGATACTG ATCCAAGAAG ATTGGTTCAA TTGTTGAGAC AACATTCTTC TCCATGGCAA 138 0 

GTTTATGGTT TTGTTAGAGC TTGTTTGAGA AGATTGGTTC CACCAGGTTT GTGGGGTTCT 144 0 

AGACATAATG AAAGAAGATT TTTGAGAAAT ACTAAAAAAT TTATTTCTTT GGGTAAACAT 1500 

GCTAAATTGT CTTTGCAAGA ATTGACTTGG AAAATGTCTG TTAGAGATTG TGGTTGGTTG 1560 

AGAAGATCTC CAGGTGTTGG TTGTGTTCCA GCTGCTGAAC ATAGATTGAG AGAAGAAATT 162 0 

TTGGCTAAA.T TTTTGCATTG GTTGATGTCT GTTTATGTTG TTGAATTGTT GAGATCTTTT 1680 

TTTTATGTTA CTGAAACTAC TTTTCAAAAA AATAGATTGT TTTTTTATAG AAAATCTG7T 174 0 

TGGTCTAAAT TGCAATCTAT TGGTATTAGA CAACATTTGA AAAGAGTTCA ATTGAGAGAA 1800 

T7GTCTGAAG CTGAAGTTAG ACAACATAGA GAAGCTAGAC CAGCTTTGT7 GACTTCTAGA 1860 

77GAGATTTA T7CCAAAACC AGATGGTTTG . AGACCAATTG TTAATATGGA TTATGTTG77 1920 

GGTGCTAGAA CTTT7AGAAG A G AAAAAAG A GCTGA-AAGAT TGACTTCT.AG AGTT-AAAGC7 138 0 

77GTTTTC7G 7TT7GAATTA TGAAAGAGCT AGAAGACCAG GT77GTTGGG TGCTTCTG7T 204 0 

77GGGTT7GG ATGATATTCA 7AGAGCT7GG AGAACTTTTG TT77GAGAGT TAGAGCTCAA 210 0 

GA7CCACCAC CAGAATTGTA TTTTGTTAAA G7TGATGTTA CTGGTGC7TA TG AT ACT ATT 2160 

■CCACAA.GATA GATTGACTGA AGTTATTGCT TCTATTATTA AACCACAAAA TACTTATTGT 222 0 

GTTAGAAGAT ATGCTGTTGT TCAAAAAGCT GCTCATGGTC ATGTTAGAAA AGCTTTTAAA " 2 28 0 

7C7CATGTTT CTACTTTGAC TGATTTGCAA CCATATATGA GACAATTTGT TGCTCATTTG 2 34 0 

CAAGAAACTT CTCCATTGAG AGATGCTGTT G7TATTGAAC AATCTTCTTC TTTGAATGAA 2 4 00 

GCTTCTTCTG GTTTGTTTGA TGTTTTTTTG AGATTTATGT GTCATCATGC TGTTAGAATT 2 4 60 

AGAGGTAAA.T CTTATGTTCA ATGTCAAGGT ATTCCACAAG GTTCTATTTT GTCTACTTTG 2520 

77GTGTTCTT TGTGTTATGG TGATATGGAA AATAAATTGT TTGCTGGTAT TAGAAGAGAT 2580 

GGTTTGTTGT TGAGATTGGT TGATGATTTT TTGTTGGTTA CTCCACATTT GACTCATGCT 2 64 0 

AAAACTTTTT TGAGAACTTT. GGTTAGAGGT GTTCCAGAAT ATGGTTGTGT TGTTAATTTG 2 700 
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AGAAJ&AACTG 


TTGTTAATTT 


TCCAGTTGAA 


GATGAAGCTT 


TGGGTGGTAC 


TGCTTTTGTT 


2760 


CAAATGCCAG 


CTCATGGTTT 


GTTTCCATGG 


TGTGGTTTGT 


TGTTGGATAC 


TAGAACTTTG 


2820 


GAAGTTCAAT 


CTGATTATTC 


TTCTTATGCT 


AGAACTTCTA 


TTAGAGCTTC 


TTTGACTTTT 


2880 


AATAGAGGTT 


TTAAAGCTGG 


TAGAAATATG 


AGAAGAAAAT 


TGTTTGGTGT 


TTTGAGATTG 


2940 


AAATGTCATT 


CTTTGTTTTT 


GGATTTGCAA 


GTTAATTCTT 


TGCAAACTGT 


TTGTACTAAT 


3000 


ATTTATAAAA 


TTTTGTTGTT 


GCAAGCTTAT 


AGATTTCATG 


CTTGTGTTTT 


GCAATTGCCA 


3060 


TTTCATCAAC 


AAGTTTGGAA 


AAATCCAACT 


TTTTTTTTGA 


GAGTTATTTC 


TGATACTGCT 


3120 


TCTTTGTGTT 


ATTCTATTTT 


GAAAGCTAAA 


AATGCTGGTA. 


TGTCTTTGGG 


TGCTAAAGGT 


3180 


GCTGCTGGTC 


CATTGCCATC 


TGAAGCTGTT 


CAATGGTTGT 


GTCATCAAGC 


TTTTTTGTTG 


324 0 


AAATTGACTA 


GACATAGAGT 


TACTTATGTT 


CCATTGTTGG 


GTTCTTTGAG 


AACTGCTCAA 


3300 


ACTCAATTGT 


CTAGAAAATT 


GCCAGGTACT 


ACTTTGACTG 


CTTTGGAAGC 


TGCTGCTAAT- 


3360 


CCAGCTTTGC 


CATCTGATTT 


TAAAACTATT 


TTGGAT 






3396 



(2) INFORMATION FOR 'SEQ ID NO: 641: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3396 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1.-.3396 

(D) OTHER INFORMATION: /note= "hTRT-encoding sequence employing 

alternative codon distributions for 
yeast (high expressing genes)" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 1: 

ATGCCAAGAG CTCCAAGATG TAGAGCTGTT AGATCTTTGT TGAGATCTCA CTACAGAGAA 60 

GTTTTGCCAT TGGCTACTTT CGTTAGAAGA TTGGGTCCAC AAGGTTGGAG ATTGGTTCAA 120 

AGAGGTGACC CAGCTGCTTT CAGAGCTTTG GTTGCTCAAT GTTTGGTTTG TGTTCCATGG 180 

GACGCTAGAC CACCACCAGC TGCTCCATCT TTCAGACAAG TTTCTTGTTT GA-.GGAATTG 240 

GTTGCTAGAG TTTTGCAAAG ATTGTGTGAA AGAGGTGCTA AGAACGTTTT GGCTTTCGGT 300 

TTCGCTTTGT TGGACGGTGC TAGAGGTGGT CCACCAGAAG CTTTCACTAC TTCTGTTAGA 360 

TCTTACTTGC CAAACACTGT TACTGACGCT TTGAGAGGTT CTGGTGCTTG GGGTTTGTTG 4 20 

TTGAGAAGAG TTGGTGACGA CGTTTTGGTT CACTTGTTGG CTAGATGTGC TTTGTTCGTT 4 80 
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T7GGTTGCTC CATCTTGTGC TTACCAAGTT TGTGGTCCAC CATTGTACCA . ATTGGGTGCT 540 

GCTACTCAAG CTAGACCACC ACCACACGCT TCTGGTCCAA GAAGAAGATT GGGTTGTGAA 600 

AGAGCTTGGA ACCACTCTGT TAGAGAAGCT GGTGTTCCAT TGGGTTTGCC AGCTCCAGGT 6 60 

GCTAGAAGAA GAGGTGGTTC TGCTTCTAGA TCTTTGCCAT TGCCAAAGAG ACCAAGAAGA 7 20 

GGTGCTGCTC CAGAACCAGA AAGAACTCCA GTTGGTCAAG GTTCTTGGGC TCACCCAGGT 7 80 

AGAACTAGAG GTCCATCTGA CAGAGGTTTC TGTGTTGTTT CTCCAGCTAG ACCAGCTGAA 8 40 

GAAGCTACTT CTTTGGAAGG TGCTTTGTCT GGTACTAGAC ACTCTCACCC ATCTGTTGGT 900 

AG AC AAC AC C ACGCTGGTCC ACCATCTACT TCTAGACCAC CAAGACCATG GGACACTCCA 9 60 

TGTCCACCAG TTTACGCTGA AACTAAGCAC TTCTTGTACT CTTCTGGTGA CAAGGAACAA 1020 

TTGAGACCAT CTTTCTTGTT GTCTTCTTTG AGACCATCTT TGACTGGTGC TAGAAGATTG. 1080 

GTTGAAACTA TTTTCTTGGG TTCTAGACCA TGGATGCCAG GTACTCCAAG AAGATTGCCA 114 0 

AGATTGCCAC AAAGATACTG GCAAATGAGA CCATTGTTCT TGGAATTGTT GGGTAACCAC 1200 

GCTCAATGTC CATACGGTGT TTTGTTGAAG ACTCACTGTC CATTG AGAGC TGCTGTTACT. 12 60 

CCAGCTGCTG GTGTTTGTGC TAGAGAAAAG CCACAAGGTT CTGTTGCTGC TCCAGAAGAA .1320 

GAAGACACTG ACCCAAGAAG ATTGGTTCAA TTGTTGAGAC AACACTCTTC TCCATGGCAA 138 0 

G77TACGGTT TCGTTAGAGC TTGTTTGAGA AGATTGGTTC CACCAGGTTT GTGGGGTTCT 14 40 

AGACACAACG A-AAGAAGATT CTTGAGAAAC ACTAAGAAGT TCATTTCTTT GGGTAAGCAC 1500 

GCTAAGTTGT CTTTGCAAGA ATTGACTTGG AAGATGTCTG TTAGAGACTG TGCTTGGTTG 15 60 

AGA-AGATCTC CAGGTGTTGG TTGTGTTCCA GCTGCTGAAC ACAGATTGAG AGAAGAAATT 1-62 0 

77GGCTAAGT TCTTGCACTG GTTGATGTCT GTTTACGTTG TTGAATTGTT GAGATCTTTC 1680 

77CTACGTTA CTGAAACTAC TTTCCAAAAG AACAGATTGT TCTTCTACAG A.AAGTCTGTT 17 4 0 

7GGTCTAAGT TGCAATCTAT TGGTATTAGA CAACACTTGA AGAGAGTTCA ATTGAGAGAA 18 00 

TTGTCTGAAG. CTGAAGTTAG ACAACACAGA GAAGCTAGAC CAGCTTTGTT GACTTCTAGA 18 60 

TTGAGATTCA TTCCAAAGCC AGACGGTTTG AGACCAATTG TTAACATGGA CTACGTTGTT 1920 

GGTGCTAGAA CTTTCAGAAG AGAAAAGAGA GCTGAAAGAT TGACTTCTAG AGTTAAGGCT 1980 

TTGTTCTCTG TTTTGAACTA CGAAAGAGCT AGAAGACCAG GTTTGTTGGG TGCTTCTGTT 20 4 0 

77GGGTTTGG ACGACATTCA CAGAGCTTGG AG.AACTTTCG TTTTGAGAGT TAGAGCTCAJV 2100 

GACCCACCAC CAGAATTGTA CTTCGTTAAG GTTGACGTTA CTGGTGCTTA CGACACTATT 2160 

CCACAAGACA GATTGACTGA AGTTATTGCT TCTATTATTA AGCCACAAAA CACTTACTGT 2220 

GTTAGAAGAT ACGCTGTTGT TCAAAAGGCT GCTCACGGTC ACGTTAGAAA GGCTTTCAAG 22 80 

TCTCACGTTT CTACTTTGAC TGACTTGCAA CCATACATGA GACAATTCGT TGCTCACTTG 234 0 
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CAAGAAACTT CTCCATTGAG AGACGCTGTT GTTATTGAAC AATCTTCTTC TTTGAACGAA 2 4 00 

GCTTCTTCTG GTTTGTTCGA CGTTTTCTTG AGATTCATGT GTCACCACGC TGTTAGAATT 2 4 60 

AGAGGTAAGT CTTACGTTCA ATGTCAAGGT ATTCCACAAG GTTCTATTTT GTCTACTTTG 2 52 0 

TTGTGTTCTT TGTGTTACGG TGACATGGAA AACAAGTTGT TCGCTGGTAT TAGAAGAGAC 2 58 0 

GGTTTGTTGT TGAGATTGGT TGACGACTTC TTGTTGGTTA CTCCACACTT GACTCACGCT 2 64 0 

AAGACTTTCT TGAGAACTTT GGTTAGAGGT GTTCCAGAAT ACGGTTGTGT TGTTAACTTG 2700 

AGAAAGACTG TTGTTAACTT CCCAGTTGAA GACGAAGCTT TGGGTGGTAC TGCTTTCGTT 27 60 

CAAATGCCAG CTCACGGTTT GTTCCCATGG TGTGGTTTGT TGTTGGACAC TAGAACTTTG 2 820 

GAAGTTCAAT CTGACTACTC TTCTTACGCT AGAACTTCTA TTAGAGCTTC TTTGACTTTC 2 88 0 

AA.CAGAGGTT TCAAGGCTGG TAGAAACATG AGAAGAAAGT TGTTCGGTGT TTTGAGATTG 2 94 0 

AAGTGTCACT CTTTGTTCTT GGACTTGCAA GTT-AACTCTT TGCAAACTGT TTGTACTAAC 3000 

ATTTACAAGA' TTTTGTTGTT GCAAGCTTAC AGATTCCACG CTTGTGTTTT GCAATTGCCA 3060 

TTCCACCAAC AAGTTTGGAA GAACCCAACT TTCTTCTTGA ■ GAGTTATTTC TGACACTGCT 3120 

TCTTTGTGTT ACTCTATTTT GAAGGCTAAG AACGCTGGTA TGTCTTTGGG TGCTAAGGGT 318 0 

GCTGCTGGTC CATTGCCATC TGAAGCTGTT CAATGGTTGT GTCACCAAGC TTTCTTGTTG 324 0 

AAGTTGACTA GACACAGAGT TACTTACGTT CCATTGTTGG GTTCTTTGAG AACTGCTCAA 3 300 

AC7CAATTGT CTAGAAAGTT GCCAGGTACT ACTTTGACTG CT7TGGAAGC TGCTGCTAAC 3 360 

CCAGCTTTGC CATCTGACTT CAAGACT'ATT TTGGAC - 3 39 6 

;2; information for seq id no:6-;2; 
(i) 'sequence characteristics: 

(A) LENGTH: 3396 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: -linear 

(ii) MOLECULE TYPE: DMA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . .3396 

(D) OTHER INFORMATION: /note- ""generic" hTRT protein encoding 

sequence " 

SEQUENCE DESCRIPTION: SEQ ID NO: 642: 
ATGCCACGTG CCCCACGTTG TCGTGCCGTT CGTTCTTTGT TGCGTTCTCA CTACCGTGAA 60 
GTTTTGCCAT TGGCCACCTT CGTTCGTCGT TTGGGTCCAC AAGGTTGGCG TTTGGTTCAA 12 0 

CGTGGTGATC CAGCCGCCTT CCGTGCCTTG GTTGCCCAAT GTTTGGTTTG TGTTCCATGG 180 
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GATGCCCGTC CACCACCAGC CGCCCCATCT TTCCGTCAAG TTTCTTGTTT GAAAGAATTG 24 0 

GTTGCCCGTG TTTTGCAACG TTTGTGTGAA CGTGGTGCCA AAAACGTTTT GGCCTTCGGT 300 

TTCGCCTTGT TGGATGGTGC CCGTGGTGGT CCACCAGAAG CCTTCAGCAC CTCTGTTCGT 3 60 

TCTTACTTGC CAAACACCGT TACCGATGCC TTGCGTGGTT CTGGTGCCTG GGGTTTGTTG 4 20 

TTGCGTCGTG TTGGTGATGA TGTTTTGGTT CAC.TTGTTGG CCCGTTGTGC CTTGTTCGTT 4 80 

TTGGTTGCCC CATCTTGTGC CTACCAAGTT TGTGGTCCAC CATTGTACCA ATTGGGTGCC 54 0 

GCCACCCAAG' CCCGTCCACC ACCACACGCC TCTGGTCCAC GTCGTCGTTT GGGTTGTGAA 600 

CGTGCCTGGA ACCACTCTGT TCGTGAAGCC GGTGTTCCAT TGGGTTTGCC AGCCCCAGGT 660 

GCCCGTCGTC GTGGTGGTTC TGCCTCTCGT TCTTTGCCAT TGCCAAAACG TCCACGTCGT 7 20 

GGTGCCGCCC CAGAACCAGA ACGTACCCCA GTTGGTCAAG GTTCTTGGGC CCACCCAGGT 7 80 

CGTACCCGTG GTCCATCTGA TCGTGGTTTC TGTGTTGTTT CTCCAGCCCG TCCAGCCGAA 840 

GAAGCCACCT CTTTGGAAGG TGCCTTGTCT GGTACCCGTC ACTCTCACCC ATCTGTTGGT 900 

CGTCAACACC ACGCCGGTCC ACCATCTACC " TCTCGTCCAC CACGTCCATG GGATACCCCA 9 60 

TGTCCACCAG TTTACGCCGA" AACCAAACAC TTCTTGTACT CTTCTGGTGA TAAAGAACAA 1020 

TTGCGTCCAT CTTTCTTGTT GTCTTCTTTG CGTCCATCTT TGACCGGTGC CCGTCGTTTG 1080 

GTTGAAACCA TTTTCTTGGG TTCTCGTCCA TGGATGCCAG GTACCCCACG TCGTTTGCCA 1140 

CGTTTGCCAC AACGTTACTG GCAAAT'GCGT CCATTGTTCT TGGAATTGTT GGGTAACCAC 12 00 

GCCCAATGTC CATACGGTGT TTTGTTGAAA ACCCACTGTC . CATTGCGTGC CGCCGTTACC 12 60 

CCAGCCGCCG GTGTTTGTGC CCGTGAAAAA CCACAAGGTT CTGTTGCCGG CCCAGAAGAA 1320 

GAAGATACCG ATCCACGTCG TTTGGTTCAA TTGTTGCGTC AACACTCTTC TCCATGGCAA 13 80 

GTTTACGGTT TCGTTCGTGC CTGTTTGCGT CGTTTGGTTC CACCAGGTTT GTGGGGTTCT 1440 

CGTCACAACG AACGTCGTTT CTTGCGTAAC ACCAAAAAAT TCATTTCTTT GGGTAAACAC 15 00 

GCCAAATTGT CTTTGCAAGA ATTGACCTGG AAAATGTCTG TTCGTGATTG TGCCTGGTTG 15 60 

CGTCGTTCTC CAGGTGTTGG TTGTGTTCCA GCCGCCGAAC ACCGTTTGCG TGAAGAAATT 1620 

TTGGCCAAAT TCTTGCACTG GTTGATGTCT GTTTACGTTG TTGAATTGTT GCGTTCTTTC 1680 

TTCTACGTTA CCGAAACCAC CTTCCAAAAA AACCGTTTGT TCTTCTACCG TAAATCTGTT 17 4 0 

TGGTCTAAAT TGCAATCTAT TGGTATTCGT CAACACTTGA AJ^CGTGTTCA ATTGCGTGAA 18 00 

TTGTCTGAAG CCGAAGTTCG TCAACACCGT GAJVGCCCGTC CAGCCTTGTT GACCTCTCGT 18 60 

TTGCGTTTCA TTCCAAAACC AGAT.GGTTTG CGTCCA.z\TTG TTAACATGGA TTACGTTGTT 1920 

GGTGCCCGTA CCTTCCGTCG TGAAAAACGT GCCGAACGTT TGACCTCTCG TGTTAAAGCC 1980 

TTGTTCTCTG TTTTGAACTA CGAACGTGCC CGTCGTCCAG GTTTGTTGGG TGCCTCTGTT 204 0 
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TTGGGTTTGG ATGATATTCA CCGTGCCTGG CGTACCTTCG TTTTGCGTGT TCGTGCCCAA 2100 

GATCCACCAC CAGAATTGTA CTTCGTTAAA GTTGATGTTA CCGGTGCCTA CGATACCATT 2160 

CCACAAGATC GTTTGACCGA AGTTATTGCC TCTATTATTA AACCACAAAA CACCTACTGT 2220 

GTTCGTCGTT ACGCCGTTGT TCAAAAAGCC GCCCACGGTC ACGTTCGTAA AGCCTTCAAA 2280 

TCTCACGTTT CTACCTTGAC CGATTTGCAA CCATACATGC GTCAATTCGT TGCCCACTTG 2 34 0 

CAAGAAACCT CTCCATTGCG TGATGCCGTT GTTATTGAAC AATCTTCTTC TTTGAACGAA 2 4 00 

GCCTCTTCTG GTTTGTTCGA TGTTTTCTTG CGTTTCATGT GTCACCACGC CGTTCGTATT 2 4 60 

CGTGGTAAAT CTTACGTTCA ATGTCAAGGT ATTCCACAAG GTTCTATTTT GTCTACCTTG 2520 

TTGTGTTCTT TGTGTTACGG TGATATGGAA AACAAATTGT TCGCCGGTAT TCGTCGTGAT 2580 

GGTTTGTTGT TGCGTTTGGT TGATGATTTC TTGTTGGTTA CCCCACACTT GACCCACGCC 2 64 0 

AAAACCTTCT TGCGTACCTT- GGTTCGTGGT GTTCCAGAAT ACGGTTGTGT TGTTAACTTG 2700 

CGTAAAACCG TTGTTAACTT CCCAGTTGAA GATGAAGCCT TGGGTGGTAC CGCCTTCGTT 27 60 

CAAATGCCAG CCCACGGTTT GTTCCCATGG TGTGGTTTGT TGTTGGATAC CCGTACCTTG 2820 

GAAGTTCAAT CTGATTACTC TTCTTACGCC CGTACCTCTA TTCGTGCCTC TTTGACCTTC 2880 

AACCGTGGTT TCAAAGCCGG TCGTAACATG CGTCGTAAAT TGTTCGGTGT TTTGCGTTTG 2 940 

AAATGTCACT CTTTGTTCTT GGATTTGCAA GTTAACTCTT TGCAAACCGT TTGTACCAAC 3000 

A T T T A C A J\AA TTTTGTTGTT GCAAGCCTAC CGTTTCCACG CCTGTGTTTT GCAATTGCCA 30 60 

TTCCACCAAC AAGTTTGGAA AAACCCAACC TTCTTCTTGC GTGTTATTTC TGATACCGCC 3120 

TCTTTGTGTT ACTCTATTTT GAAAGCCAAA AACGCCGGTA TGTCTTTGGG TGCCAAAGGT ^3180 

GCCGCCGGTC -CATTGCCA.TC TGAAGCCGTT CAATGGTTGT GTCACCAAGC CTTCTTGTTG 32 4 0 

AAATTGACCC GTCACCGTGT TACCTACGTT CCATTGTTGG GTTCTTTGCG TACCGCCCAA 3300 

ACCCAATTG7 CTCGTAAATT GCCAGGTACC ACCTTGACCG CCTTGGAAGC CGCCGCCAAC 3 3 60 

CCAGCCTTGC CATCTGATTT CAAAACCATT TTGGAT 3 3 96 

(2) INFORMATION FOR SEQ ID NO: 64 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 100 

(D) OTHER INFORMATION: /note- "oligonucleotide IB" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 
CCAGCGGCAG AACTTCGCGA TAGTGGGAAC GCAGCAGGGA ACGAACAGCA CGGCAACGCG 60 
GAGCACGCGG CATATGGTCG ACTCTAGAGC TCCCGCGTGC 100 



(2) INFORMATION FOR SEQ ID NO: 64 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide IT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 
GCACGCGGGA GCTCTAGAGT CGACCATATG CCGCGTGCTC CGCGTTGCCG TGCTGTTCGT 60 
TCCCTGCTGC GTTCCCACTA TCGCGAAGTT _ 90 

(2) INFORMATION FOR SEQ ID NO:645: - 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 2B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 
GGCACTGAGC AACCAGAGCA CGGAAAGCAG CCGGGTCACC ACGCTGAACC AGACGCCAAC 60 
CCTGCGGGCC CAGACGACGA ACGAAGGTAG 90 
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(2) INFORMATION FOR SEQ ID NO: 646: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

( 0 ) OTHER INFORMATION: /note- "oligonucleotide 2T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 



CTGCCGCTGG CTACCTTCGT TCGTCGTCTG GGCCCGCAGG GTTGGCGTCT GGTTCAGCGT 
GGTGACCCGG CTGCTTTCCG TGCTCTGGTT 



(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
'(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 33" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 
GAACACGAGC AACCAGTTCT TTCAGGCAGG AAACCTGACG GAAGGACGGA GCAGCCGGCG 
GCGGACGAGC GTCGCACGGA ACGCAAACCA 



(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DNA 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION : /note- "oligonucleotide 3T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:648: 
GCTCAGTGCC TGGTTTGCGT TCCGTGGGAC GCTCGTCCGC CGCCGGCTGC TCCGTCCTTC 
CGTCAGGTTT CCTGCCTGAA AGAACTGGTT 



(2} INFORMATION FOR SEQ ID NO: 64 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



( ix) FEATURE: 

(A) NAME/KEY: - 

(3)' LOCATION: 1 . ,'90 

(D) OTHER INFORMATION: /note* "oligonucleotide 4B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 
ATGCTTCCGG' CGGACCACCA CGAGCACCGT CCAGCAGAGC GAAACCGAAA GCCAGAACGT 
TTTTAGCACC ACGTTCGCAC AGACGCTGCA 



INFORMATION FOR SEQ ID MO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

( ii) ~ MOLECULE TYPE: DNA 



Cix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 4T n 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:650: 
GCTCGTGTTC TGCAGCGTCT GTGCGAACGT GGTGCTAAAA ACGTTCTGGC TTTCGGTTTC 
GCTCTGCTGG ACGGTGCTCG TGGTGGTCCG 
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(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

<D) OTHER INFORMATION : /note= "oligonucleotide 5B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651: 
CAACACGACG CAGCAGCAGA CCCCAAGCAC CGGAACCACG CAGAGCGTCG GTAACGGTGT 
TCGGCAGGTA GGAACGAACG GAGGTGGTGA 



(2) INFORMATION FOR SEQ ID NO: 652: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
"(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 90 

-(D) OTHER INFORMATION: /note= "oligonucleotide 5T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 652: 
CCGGAAGCAT TCACCACCTC CGTTCGTTCC TACCTGCCGA ACACCGTTAC CGACGCTC7G 
CGTGGTTCCG GTGCTTGGGG TCTGCTGCTG 



(2) INFORMATION FOR SEQ ID NO: 653: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION : 1 . . 90 

( D) OTHER INFORMATION : /note= "oligonucleotide 6B M 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 
GCGGCGGACC ACAAACCTGG TAAGCGCAGG ACGGAGCAAC CAGAACGAAC AGAGCGCAAC 60 
GAGCCAGCAG GTGAACCAGA ACGTCGTCAC 90 

(2) INFORMATION FOR SEQ ID NO: 654: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA ■ 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 6T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 
CGTCGTGTTG GTGACGACGT TCTGGTTCAC CTGCTGGCTC GTTGCGCTCT GTTCGTTCTG 60 
GTTGCTCCGT CCTGCGCTTA CCAGGTTTGT 90 



(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

[ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 7B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 
GGTTCCAAGC ACGTTCGCAA CCCAGACGAC GACGCGGACC GGAAGCGTGC GGCGGCGGAC 
GAGCCTGGGT AGCAGCACCC AGCTGGTACA 



60 
90 
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(2) INFORMATION FOR SEQ ID NO: 65 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 , . 90 

(D) OTHER INFORMATION : /note** "oligonucleotide 7 T " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 
GGTCCGCCGC TGTACCAGCT GGGTGCTGCT ACCCAGGCTC GTCCGCCGCC GCACGCTTCC 
GGTCCGCGTC GTCGTCTGGG TTGCGAACGT 



(2) INFORMATION FOR SEQ ID NO: 657: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION : I . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 3 B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 
GCAGCGGCAG GGAACGGGAA GCGGAACCAC CACGACGACG AGCACCCGGA GCCGGCAGAC 
CCAGCGGAAC ACCAGCTTCA CGAACGGAGT 



(2) INFORMATION FOR SEQ ID NO: 658: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 
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(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 8T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 
GCTTGGAACC ACTCCGTTCG TGAAGCTGGT GTTCCGCTGG GTCTGCCGGC TCCGGGTGCT 
CGTCGTCGTG GTGGTTCCGC TTCCCGTTCC 



(2) INFORMATION FOR SEQ ID NO: 65 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 
"(B) LOCATION: 1 . . 90 
(D) OTHER INFORMATION: /note= "oligonucleotide 9 B M 



(xi) SEQUENCE: DESCRIPTION: SEQ ID NO: 659: 
GACCACGGGT ACGACCCGGG TGAGCCCAGG AACCCTGACC AACCGGGGTA CGTTCCGGTT 
CCGGAGCAGC ACCACGACGC GGACGTTTCG 



',2) INFORMATION FOR SEQ I'D NO: 660: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 
■ .(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 9T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 
CTGCCGCTGC CGAAACGTCC GCGTCGTGGT GCTGCTCCGG AACCGGAACG TACCCCGGTT 
GGTCAGGGTT CCTGGGCTCA CCCGGGTCGT 
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(2) INFORMATION FOR SEQ ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 



(ix) FEATURE: 

(A) NAME/KEY: - 

<B) LOCATION: 1 . . 90 . 

(D) OTHER INFORMATION: /note- "oligonucleotide 10B" 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 
AGTGACGGGT GCCGGACAGA GCACCTTCCA GGGAGGTAGC TTCTTCAGCC GGACGAGCCG 
GGGAAACAAC GCAGAAACCA CGGTCGGACG 



{2} INFORMATION FOR SEQ ID NO: 662.: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: 



(ix) FEATURE: 

( A) NAME /KEY : - 

(3) LOCATION : 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonuclect ice 10T" 



(;<i) SEQUENCE DESCRIPTION: SEQ ID NO: 662: 
ACCCGTGGTC CGTCCGACCG TGGTTTCTGC GTTGTTTCCC CGGCTCGTCC GGCTGAAGAA 
GCTACCTCCC TGGAAGGTGC TCTGTCCGGC 



(2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : ' 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 11B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 
AAACC GGCGG GCACGGGGTG TCCCACGGAC GCGGCGGACG GGAGGTGGAC GGCGGACCAG 
CGTGGTGCTG ACGACCAACG GACGGGTGGG 



(2) INFORMATION FOR SEQ ID NO: 664: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: 



(ix) FEATURE: 

(A) NAME /KEY : - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 11T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664: 
-.CCCGTCAC? CCCACCCGTC CGTTGGTCGT CAGCACCACG CTGGTCCGCC GTCCACCTCC' 
CGTCCGCCGC GTCCGTGGGA CACCCCGTGC 



:2} INFORMATION FOR SEQ ID NO: 665: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 12B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 65: 
CAGGGACGG ACGCAGGGAG GACAGCAGGA AGGACGGACG CAGCTGTTCT TTGTCACCGG 
.GGAGTACAG GAAGTGTTTG GTTTCAGCGT 
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(2) INFORMATION FOR SEQ ID KO:666: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



DNA 



(ix) FEATURE: 

CA) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 12T" 
fxi) SEQUENCE DESCRIPTION: SEQ ID NO: 666-: 
CCGCCGGTTT ACGCTGAAAC CAAACACTTC CTGTACTCCT CCGGTGACAA AGAACAGCTG 60 
CQTCCGTCCT TCCTGCTGTC CTCCCTGCGT 

90 



(2} INFORMATION FOR SEQ ID NO: 667; 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 
(-C) STRANDEDNESS: sinale 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



ix) FEATURE: 

(A) NAME /KEY: - 

I 3) LOCATION: 1 . . 90 

{ D) OTHER INFORMATION: /note- "oligonucleotide 13B' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 

60 



GCTGCGGCAG ACGCGGCAGA CGACGCGGGG TGCCCGGCAT CCACGGACGG GAACCCAGGA 



AGATAGTTTC AACCAGACGA CGAGCACCGG 

90 



(2) INFORMATION FOR SEQ ID NO: 668: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 90 

(D) OTHER INFORMATION : /note- "oligonucleotide 13T" 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 : 
CCGTCCCTGA CCGGTGCTCG TCGTCTGGTT GAAACTATCT TCCTGGGTTC CCGTCCGTGG 60 
ATGCCGGGCA-CCCCGCGTCG TCTGCCGCGT 

90 

(2) INFORMATION FOR SEQ ID NO: 669: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME / KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 14B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 
GCGGGCAGTG GGTTTTCAGC AGAACACCAT ACGGGCACTG AGCGTGGTTG CCCAGCAGTT 60 
CCAGGAACAG CGGACGCATC TGCCAGTAAC gQ 

(2) INFORMATION FOR SEQ IDNO:670: 

(i) SEQUENCE CHARACTERISTICS: - 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) ' TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1. . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 14T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670: 
CTGCCGCAGC GTTACTGGCA GATGCGTCCG CTGTTCCTGG AACTGCTGGG CAACCACGCT . 60 

CAGTGCCCGT ATGGTGTTCT GCTGAAAACC 9Q 
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(2) INFORMATION FOR SEQ ID NO: 671: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(2) INFORMATION FOR SEQ ID NO; 672: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
90 



(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..90 

(D) OTHER INFORMATION: /note- "oligonucleotide 15B" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 6.71: 
GGTCGGTATC TTCTTCTTCC GGAGCAGCAA CGGAACCCTG CGGTTTTTCA CGAGCGCAAA 
CACCAGCAGC CGGGGTAACA GCAGCACGCA 



(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A)' NAME /KEY: - 

(3) LOCATION: 1 . . 90 

CD) OTHER INFORMATION: /note- "oligonucleotide 15T" 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 672: 
CACTGCCCGC TGCGTGCTGC TGTTACCCCG GCTGCTGGTG TTTGCGCTCG TGAAAAACCG 
CAGGGTTCCG TTGCTGCTCC GGAAGAAGAA 

(2) INFORMATION FOR SEQ ID NO : 673: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(8) TYPE: nucleic acid 
(C) STRANDEDNESS: single - 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1. . 90 

"(D) OTHER INFORMATION : /note- "oligonucleotide 16B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673: 
GCGGAACCAG ACGACGCAGG CATGCACGAA CGAAACCGTA AACCTGCCAC GGGGAGGAGT 
GCTGACGCAG CAGCTGAACC AGACGACGCG 



(2) INFORMATION FOR SEQ ID NO: 67 4: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DMA 

(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 16T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 
GATACCGACC.CGCGTCGTCT GGTTCAGCTG CTGCGTCAGC ACTCCTCCCC GTGGCAGGTT 
TACGGTTTCG TTCGTGCATG CCTGCGTCGT 



(2) INFORMATION FOR SEQ ID NO: 675: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base oairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: -■ 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 17B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 
GGG AC AGTTT AGCGTGTTTA CCCAGGGAGA TGAATTTTTT GGTGTTACGC AGGAAACGAC 
GTTCGTTGTG ACGGGAACCC CACAGACCCG 
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(2) INFORMATION FOR SEQ ID NO: 676: 

(i) SEQUENCE CHARACTERISTICS: 
(Aj LENGTH: 90 base pairs 
(B) TYPE; nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 17T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO;676: 
CTGGTTCCGC CGGGTCTGTG GGGTTCCCGT CACAACGAAC GTCGTTTCCT GCGTAACACC 60 
AAAAAATTCA TCTCCCTGGG TAAACACGCT 



90 



(2) INFORMATION FOR SEQ ID NO:677: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(i>:) FEATURE: - - 

CA) NAME/ KEY : - 
(3) LOCATION : 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 18B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 67 7: 
GGTGTTCAGC AGCCGGAACG CAACCAACAC CCGGAGAACG ACGCAGCCAA GCGCAGTCAC 60 
GAACGGACAT TTTCCAGGTC AGTTCCTGCA - 9Q 

(2) INFORMATION FOR SEQ ID NO: 678: 

U) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic 'acid 
CC) STRANDEDNESS : single 
(D) TOPOLOGY: linear- 

(ii) MOLECULE TYPE: DNA ■ " ' 
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(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION : /note- "oligonucleotide 1ST" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 
AAACTGTCCC TGCAGGAACT GACCTGGAAA ATGTCCGTTC GTGACTGCGC TTGGCTGCGT 60 
CGTTCTCCGG GTGTTGGTTG CGTTCCGGCT 9Q 

(2) INFORMATION FOR SEQ ID NO: 679: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid ■ 
(CJ STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 193" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 67 9: 
CGGTAACGTA GAAGAAGGAA CGCAGCAGTT CAACAACGTA TACGGACATC AGCCAGTGCA 60 
GGAATTTAGC CAGGATTTCT TCACGCAGAC 

(2) INFORMATION FOR SEQ ID NO: 630: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
'(D) TOPOLOGY : linear 

- (ii) MOLECULE TYPE: DNA 

(ix) FEATURE : ■ 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: / no t e - ' " ol igonucl eo t ide 19T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680: 
GCTGAACACC GTCTGCGTGA AGAAATCCTG GCTAAATTCC TGCACTGGCT GATGTCCGTA . 60 
TACGTTGTTG AACTGCTGCG TTCCTTCTTC 90 



90 
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(2) INFORMATION FOR SEQ ID NO: 681: 

(i) SEQUENCE CHARACTERISTICS: 
(Aj LENGTH: 90 base pairs 
(B) TYPE: nucleic acid 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

fix) FEATURE : 

(AJ NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION : /note- "ol igonucieo t ide 20B" 

SEQUENCE DESCRIPTION: SEQ ID NO:681: 
GTTTCAGGTG CTGACGGATA CCGATGGACT GCAGTTTGGA CCAAACGGAT TTACGGTAGA 
AGAACAGACG GTTTTTCTGG AAGGTGGTTT 

(2) INFORMATION FOR SEQ ID NO: 682: 

(iJ SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid ( 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(i:-:) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- /'oligonucleotide 20T" 

Ui) SEQUENCE DESCRIPTION: SEQ ID "NO: 682: 
TACGTTACCG AAACCACCTT CCAGAAAAAC CGTCTGTTCT TCTACCGTAA ATCCGTTTGG " 
TCCAAACTGC. AGTCCATCGG TATCCGTCAG 

(2) INFORMATION FOR SEQ ID NO: 683: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 89 base pairs 

(B) TYPE: nucleic 'acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 
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(ix)' FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 89 

.(D) OTHER INFORMATION * /note- "ol i ™ 

iUl ' /noce " oligonucleotide 21B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:683: 
«T GMA CGC «»«»« TCAGCAGAGC GGGAGGAGCT TCACGGTGCT GACGAACTTC 60 

AGCTTCGGAC AGTTCACGCA GCTGAACAC 

89 

(2) INFORMATION FOR SEQ ID NO: 634: 

CD SEQUENCE CHARACTERISTICS* 
(A) LENGTH: 90 base pairs 
.(B) TYPE: nucleic acid ' 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1 . 90 

(D) OTHER INFORMATION : /note- "oligonucleotide 21T" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: ' 
CACCTGAAAC GTGTTGAGCT GCGTGAAGTG TGCGAAGCTG AAGTTCGTCA GCAGGGTGAA 

GCTCGTCCGG CTCTGCTGAC CTCCCGTCTG 



60 

— 90 

(2) INFORMATION FOR SEQ ID NO: 635: " 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: 



DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION : /note- "oligonucleotide 22B" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 
TCAGACGCTC AGCACGTTTT TCACGACGGA AGGTACGAGC ACCAACAACG TAGTCCATGT 60 
TTACGATCGG ACGCAGACCG TCCGGTTTCG 
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(2) INFORMATION FOR SEQ ID NO: 686: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



Cix) FEATURE: 

CA) NAME /KEY : - 

(B) LOCATION: 1. .90 

(D) OTHER INFORMATION: /note- "oligonucleotide 22T" 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 686: 
CGTTTCATCC CGAAACCGGA CGGTCTGCGT CCGATCGTAA ACATGGACTA CGTTGTTGGT 
GCTCGTACCT TCCGTCGTGA AAAACGTGCT 



(2) INFORMATION FOR SEQ ID NO: 687: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

{ B } LOCATION: 1 . . 90 

.(D) OTHER INFORMATION: /note= "oligonucleotide 23B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 : 
CGTCCAGACC CAGAACGGAA GCACCCAGCA GACCCGGACG ACGAGCACGT TCGTAGTTCA 
GAACGGAGAA CAGAGCTTTA ACACGGGAGG 



(2) INFORMATION FOR SEQ ID NO: 688: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note= "oligonucleotide 23T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 683 : 
GAGCGTCTGA CCTCCCGTGT TAAAGCTCTG TTCTCCGTTC TGAACTACGA ACGTGCTCGT 
CGTCCGGGTC TGCTGGGTGC TTCCGTTCTG 



(2) INFORMATION FOR SEQ ID NO:689: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
■(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B ) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 24B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 
CGGTAACGTC AACTTTAACG AAGTACAGTT CCGC-CGGCGG GTCCTGAGCA CGAACACGCA 
GAACGAAGGT ACGCCAAGCA CGGTGGATGT 



INFORMATION FOR SEQ ID NO: 690: 

(i) . SEQUENCE CHARACTERISTICS:' 
(A) LENGTH: 90 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D). OTHER INFORMATION: /note* "oligonucleotide 24T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:690: 
- GGTCTGGACG ACATCCACCG TGCTTGGCGT ACCTTCGTTC TGCGTGTTCG TGCTCAGGAC 



CCGCCGCCGG AACTGTACTT CGTTAAAGTT 
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(2) INFORMATION FOR SEQ ID NO : 6 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 25B" 



(xi) SEQUENCE DESCRIPTION: SEQ-ID NO: 691 : 
CGTAACGACG AACGCAGTAG GTGTTCTGCG GTTTGATGAT GGAAGCGATA ACTTCGGTCA 
GACGGTCCTG CGGGATGGTG TCGTACGCGC 



(2) INFORMATION FOR SEQ ID NO: 692: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 
(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

;5) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 25T" 



(xi) SEQUENCE DESCRIPTION 1 : SEQ ID NO: 692: 
GACGTTACCG GCGCGTACGA CACCATCCCG CAGGACCGTC TGACCGAAGT TATCGCTTCC 
ATCATCAAAC CGCAGAACAC CTACTGCGTT 



(2) INFORMATION FOR SEQ ID NO: 693: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: .90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



"(ix) FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 26B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 
GACGCATGTA CGGCTGCAGG TCGGTCAGGG TGGAAACGTG GGATTTGAAT GCTTTACGAA 
CGTGACCGTG AGCAGCTTTC TGAACAACAG 



(2) INFORMATION FOR SEQ ID NO: 694: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix)' FEATURE : ~ 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 26T" 



(:<i) SEQUENCE DESCRIPTION: SEQ ID NO:694: 
CGTCGTTACG CTGTTGTTCA GAAAGCTGCT CACGGTCACG TTCGTAAAGC ATTCAAATCC 
C:\CGTTTCCA CCCTGACCGA CCTGCAGCCG 



INFORMATION FOR SEQ ID NO: 6 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(B) LOCATION: 1 . . 90 

. (D) OTHER INFORMATION: /note- "oligonucleotide 278" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 
GACCGGAGGA AGCTTCGTTC AGGGAGGAGG ACTGTTCGAT AACAACAGCG T.CACGCAGCG 
GGGAGGTTTC CTGCAGGTGA GCAACGAACT 



(2) INFORMATION FOR SEQ ID NO: 696: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear. 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1..90 

( 0) OTHER INFORMATION: /note- "oligonucleotide 27T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 : 
TACATGCGTC AGTTCGTTGC TCACCTGCAG GAAACCTCCC CGCTGCGTGA CGCTGTTGTT 
ATCGAACAGT CCTCCTCCCT GAACGAAGCT 



(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) -STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i:-:J FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 23B"' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: ' 
AACCCTGCGG GATACCCTGG CACTGAACGT AGGATTTACC ACGGATACGA ACAGCGTGGT 
GGCACATGAA ACGCAGGAAA ACGTCGAACA 



(2) INFORMATION FOR SEQ ID NO : 698 : 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(ix) FEATURE : - 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 28T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 : 
TCCTCCGGTC TGTTCGACGT TTTCCTGCGT TTCATGTGCC ACCACGCTGT TCGTATCCGT 
GGTAAATCCT ACGTTCAGTG CCAGGGTATC 



(2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRA.NDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) FEATURE: 

(A) NAME/ KEY : - 

(3) LOCATION : 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 29B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699: 
GCAGCAGCAG ACCGTCACGA CGGATACCAG CGAACAGTTT GTTTTCCATG- TCACCGTAGC 
ACAGGGAGCA CAGCAGGGTG GACAGGATGG 



[2 ;■ .INFORMATION FOR SEQ ID NO: 700: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note« "oligonucleotide 29T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70O: 
CCGCAGGGTT CCATCCTGTC CACCCTGCTG TGCTCCCTGT GCTACGGTGA CATGGAAAA.C 
AAA.CTGTTCG CTGGTATCCG TCGTGACGGT 
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(2) INFORMATION FOR SEQ ID NO: 701: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 
' (B) LOCATION: 1 . . 90 

( D) OTHER INFORMATION: /note« "oligonucleotide 30B M 



(xi) SEQUENCE DESCRIPTION' SEQ ID NO:701: 
CGTATTCCGG AACACCACGA ACCAGGGTAC GCAGGAAGGT TTTAGCGTGG GTCAGGTGCG 
GAGTAACCAG CAGGAAGTCG TCAACCAGAC 



(2) INFORMATION FOR SEQ ID NO: 702: 

(i) SEQUENCE CHARACTERISTICS:. 
(A) LENGTH: 90 base pairs 
(3) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DMA 



(A) NAME /KEY: - 

[2; LOCATION : I . . 90 

( o ) OTHER INFORMATION: /note- "oligonucleotide 30T" 



(y.i) SEQUENCE DESCRIPTION: SEQ ID NO:702: 
CTGCTGCTGC GTCTGGTTGA CGACTTCCTG CTGGTTACTC CGCACCTGAC CCACGCTAAA 
ACCTTCCTGC GTACCCTGGT TCGTGGTGTT 



(2) TN FORMAT ION FOR SEQ ID NO: 70 3: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 90 base pairs 
(3)' TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 31B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: 
GAGCCGGCAT CTGAACGAAA GCGGTGCCAC CCAGAGCTTC GTCTTCAACC GGGAAGTTAA 
CAACGGTTTT ACGCAGGTTT ACAACGCAAC 



{2) INFORMATION FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 90 base pairs 

(B) TYPE :■ nucleic acid 

(C) STRANDEDNESS : single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(i;<) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 31T" 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:704: 
CCGGAATACG GTTGCGTTGT AAACCTGCGT AAAACCGTTG TTAACTTCCC GGTTGAAGAC 
GAAGC7CTGG GTGGCACCGC TTTCGTTCAG 



'■'2} INFORMATION FOR SEQ ID MO: 705: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE:. DNA 



(ix) FEATURE : 

(A) NAME/ KEY : - 

(3) LOCATION: 1 . . 90 

( D ) OTHER INFORMATION: /note* "oligonucleotide 32B" 



(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 
GGATGGAGGT ACGAGCGTAG GAGGAGTAGT CGGACTGAAC TTCCAGGGTA CGGGTC3TCCA 



GCAGCAGACC GCACCACGGG AACAGACCGT 
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(2) INFORMATION FOR SEQ ID NO:706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D)' OTHER INFORMATION: /note* "oligonucleotide 32T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:706: 
ATGCCGGCTC ACGGTCTGTT CCC.GTGGTGC GGTCTGCTGC T.GGACACCCG TACCCTGGAA 
GTTCAGTCCG ACTACTCCTC, CTACGCTCGT 

(2) INFORMATION FOR SEQ -ID NO: 707: 

(i) ■ SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

/note- "oligonucleotide 33B" 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:707: "* 
GGGAGTGGCA TTTCAGACGC AGAACACCGA ACAGTTTACG ACGCATGTTA CGACCAGCTT 
TGAAACCACG .GTTGAAGGTC AGGGAAGCAC 



(A) NAME /KEY : - 

(3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: 



(2) INFORMATION FOR SEQ ID NO: 703: 

(i) SEQUENCE CHARACTERISTICS: 
(A). LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY :' linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE : 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 33T ,f 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:708: 
ACCTCCATCC GTGCTTCCCT GACCTTCAAC CGTGGTTTCA AAGCTGGTCG TAACATGCGT 
CGTAAACTGT TCGGTGTTCT GCGTCTGAAA 



(2) INFORMATION FOR SEQ ID NO:709: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: -nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA 



(ix) FEATURE: ' 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(0) OTHER INFORMATION: /note= . "oligonucleotide 34B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID'NO:709: 
ACGCGTGGAA ACGGTAAGCC TGCAGCAGCA GGATTTTGTA GATGTTGGTG CAAACGGTCT 
GCAGGGAGTT TACCTGCAGG TCCAGGAACA 



INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 34T" 



(xi) SEQUENCE DESCRIPTION: SEQ I D NO : 7 1 0 : 
TGCCACTCCC TGTTCCTGGA CCTGCAGGTA AACTCCCTGC AGACCGTTTG CACCAACATC 
TACAAAATCC TGCTGCTGCA GGCTTACCGT 
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(2) INFORMATION FOR SEQ ID NO: 711: 

(ij SEQUENCE CHARACTERISTICS : 
•(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note- "oligonucleotide 35B M 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:711: 
AGTAGCACAG GGAAGCGGTG TCGGAGATAA CACGCAGGAA GAAGGTCGGG TTTTTCCAAA 60 
CCTGCTGGTG GAACGGCAGC TGCAGAACGC 90 



(2) INFORMATION FOR SEQ ID NO: 7 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) .MOLECULE TYPE: DMA 



-(ix) FEATURE: 



{A} NAME /KEY: - 

( 3) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note 



= "oligonucleotide 35T 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 2 : 



TTCCACGCGT GCGTTCTGCA GCTGCCGTTC CACCAGCAGG TTTGGAAAAA CCCGACCTTC 



60 



TTCCTGCGTG TTATCTCCGA CACCGCTTCC 
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(2) INFORMATION FOR SEQ ID NO: 713: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 



<ix) FEATURE:- ' ' 

(A) NAME /KEY: - 

(B) LOCATION: 1 . . 90 

(D) OTHER INFORMATION: /note* "oligonucleotide 36B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:713: 
GGCACAGCCA CTGAACAGCT TCGGACGGCA GCGGACCAGC AGCACCTTTA GCACCCAGGG 
ACATACCAGC GTTTTTAGCT TTCAGGATGG 



(2) INFORMATION FOR SEQ ID N0:714: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 90 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(3) LOCATION: 1 . . 90 

( D) OTHER INFORMATION: /note* "oligonucleotide 36T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID.NO:714: 
CTGTGCTACT CCA.TCCTGAA AGCTAAAAAC GCTGGTATGT CCCTGGGTGC TAAAGGTGCT 
GCTGGTCCGC TGCCGTCCGA AGCTGTTCAG 



IN FORMAT I OM FOR SEQ ID NO: 715: 

(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 87 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE; DNA 



(ix) FEATURE : . 

(A) NAME/ KEY: - 

(B) LOCATION: 1 . . 87 

■(D) OTHER INFORMATION: /note= "oligonucleotide 37B" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:715: 
ACAGCTGGGT CTGAGCGGTA CGCAGGGAAC CCAGCAGCGG AACG.TAGGTA ACACGGTGAC 



GGGTCAGTTT CAGCAGGAAA GCCTGGT 



(2) INFORMATION FOR SEQ ID NO: 716: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
CD} TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

( B ) LOCATION: 1. .87 

(D) OTHER INFORMATION: /note- "oligonucleotide 37T M 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 1 6 : 
TGGCTGTGCC ACCAGGCTTT CCTGCTGAAA CTGACCCGTC ACCGTGTTAC CTACGTTCCG 
CTGCTGGGTT. CCCTGCGTAC CGCTCAG 



(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : - 

{ 3) LOCATION: 1 . . 63 

(D) OTHER INFORMATION: /note= "oligonucleotide 33B M 



(xi) SEQUENCE DESCRIPTION: SZQ ID NO: 717: 
ACGGCAGAGC CGGGTTAGCA GCAGCTTCCA GAGCGGTCAG GGTGGTACCC GGCAGTTTAC 



(2) INFORMATION FOR SEQ ID NO:718: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS-: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 64 

(D) OTHER INFORMATION: /note- "oligonucleotide 38T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:718: 
ACCCAGCTGT CCCGTAAACT GCCGGGTACC ACCCTGACCG CTCTGGAAGC TGCTGCTAAC 
CCGG 



(2) INFORMATION FOR SEQ ID NO : 719: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/ KEY: - ■ 

(B) LOCATION: 1 . . 50 

(D) OTHER INFORMATION: /note- "oligonucleotide 39B." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:719: 
GCGTGCCTCG AGGAATTCGG ATCCATTAGT CCAGGATGGT TTTGAAGTCG 



(2} INFORMATION - FOR SEQ ID MO: 720: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

"(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY: - 

(B) LOCATION: 1..60" 

(D) OTHER INFORMATION: /note- "oligonucleotide 39T" 



(xi) SEQUENCE DESCRIPTION: SEQ ID MO:720: 
CTCTGCCGTC CGACTTCAAA ACCATCCTGG ACTAATGGAT CCGAATTCCT CGAGGCACGC 
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(2) INFORMATION FOR SEQ ID NO: 721: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME /KEY : - 

<B) LOCATION: 1 . .3451 

(D) OTHER INFORMATION: /note= "hTRT sequence employing codon 

distribution preferentially used by 
highly expressed genes in E. coli 
containing SacI and Xhol sites" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:721: 

GCACGCGGGA GCTCTAGAGT CGACCATATG .CCGCGTGCTC CGCGTTGCCG TGCTGTTCGT 60 

TCCCTGCTGC GTTCCCACTA TCGCGAAGTT CTGCCGCTGG CTACCTTCGT TCGTCGTCTG 120 

GGCCCGCAGG GTTGGCGTCT GGTTCAGCGT GGTGACCCGG CTGCTTTCCG TGCTCTGGTT 130 

GCTCAGTGCC TGGTTTGCGT TCCGTGGGAC GCTCGTCCGC CGCCGGCTGC TCCGTCCTTC 24 0 

CGTCAGGTTT - CCTGCCTGAA AGAACTGGTT GCTCGTGTTC TGCAGCGTCT GTGCGA^CGT 300 

GGTGCTAAAA ACGTTCTGGC TTTCGGTTTC GCTCTGCTGG ACGGTGCTCG TGGTGGTCCG 3 60 

CCGGAA GCAT TCACCACCTC CGTTCGTTCC TACCTGCCGA ACACCGTTAC CGACGCTCTG. 4 20. 

GG7GG7TCCG GTGCTTGGGG TCTGCTGCTG CGTCG7GTTG GTGACGACGT TCTGGTTCAC 4 8-0' 

CTGCTGGCTC GTTGCGCTCT GTTCGTTCTG GTTGC73CGT CCTGCGCTTA CCAGGTTTGT 54 0 

GGTCCGCCGC TGTACCAGCT GGGTGCTGCT ACCCAGGCTC GTCCGCCGCC GCACGCTTCC 600 

GGTCCGCGTC GTCGTCTGGG TTGCGAACGT GCTTGGAACC ACTCCGTTCG TGAAGCTGGT 660 

GTTCCGCTGG GTCTGCCGGC . TCCGGGTGCT CGTCG7CGTG GTGGTTCCGC TTCCCGTTCC 720 

CTGCCGCTGC CGAAACGTCC GCGTCGTGGT GCTGC77CGG AACCGGAACG TACCCCGGTT 7 80 

GGTCAGGGTT CCTGGGCTCA . CCCGGGTCGT ACCCG7GGTC CGTCCGACCG TGGTTTCTGC 8 40 

GTTGTTTCCC CGGCTCGTCC GGCTGAAGAA GCTACCTCCC TGGAAGGTGC TCTGTCCGGC 900 

ACCCGTCACT CCCACCCGTC CGTTGGTCGT CAGCACCACG CTGGTCCGCC GTCCACCTCC 960 

CGTCCGCCGC GTCCGTGGGA CACCCCGTGC CCGCCGGTTT ACGCT.GAAAC CAAACACTTC 1020 

CTGTACTCCT CCGGTGACAA AGAACAGCTG CGTCCG7CCT TCCTGCTGTC CTCCCTGCGT 108 0 

CCGTCCCTGA CCGGTGCTCG TCGTCTGGTT GAAAC7ATCT TCCTGGGTTC CCGTCCGTGG 114 0 

ATGCCGGGCA CCCCGCGTCG TCTGCCGCGT CTGCCGCAGC GTTACTGGCA GATGCGT.CCG 12 00 
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CTGTTCCTGG 


AACTGCTGGG 


CAACCACGGT 


CAGTGCCCGT 


ATGGTGTTCT 


GCTGAAAACC 


1260 


CACTGCCCGC 


TGCGTGCTGC 


TGTTACCCCG 


GCTGCTGGTG 


TTTGCGCTCG 


TGAAAAACCG 


1320 


CAGGGTTCCG 


TTGCTGCTCC 


GGAAGAAGAA 


GATACCGACC 


CGCGTCGTCT 


GGTTCAGCTG 


1380 


CTGCGTCAGC 


ACTCCTCCCC 


GTGGCAGGTT 


TACGGTTTCG 


TTCGTGCATG 


CCTGCGTCGT 


14 40 


CTGGTTCCGC 


CGGGTCTGTG 


GGGTTCCCGT 


CACAACGAAC 


GTCGTTTCCT 


GCGTAACACC 


1500 


AAAAAATTCA 


TCTCCCTGGG 


TAAACACGCT 


AAACTGTCCC 


TGCAGGAACT 


GACCTGGAAA 


1560 


ATGTCCGTTC 


GTGACTGCGC 


TTGGCTGCGT 


CGTTCTCCGG 


GTGTTGGTTG 


CGTTCCGGCT 


1620 


GCTGAACACC 


GTCTGCGTGA 


AGAA*ATCCTG 


GCTAAATTCC 


TGCACTGGCT 


GATGTCCGTA 


1680 


TACGTTGTTG 


AACTGCTGCG 


TTCCTTCTTC 


TACGTTACCG 


AAACCACCTT 


CCAGAAAAAC 


17 4 0 


CGTCTGTTCT 


TGTACCGTAA 


ATCCGTTTGG 


TCCAAACTGC 


AGTCCATCGG 


TATCCGTCAG 


1800 


CA'CCTGAAAC 


GTGTTCAGCT 


GCGTGAACTG 


TCCGAAGCTG 


AAGTTCGTCA. 


GCACCGTGAA 


1860 


GCTCGTCCGG 


CTCTGCTGAC 


CTCCCGTCTG 


CGTTTCATCC 


CGAAACCGGA 


CGGTCTGCGT 


1920 


CCGATCGTAA 


ACATGGACTA 


CGTTGTTGGT 


GCTCGTACCT 


TCCGTCGTGA 


AAAACGTGCT 


1980 


GAGCGTCTGA 


CCTCCCGTGT 


TAAAGCTCTG 


TTCTCCGTTC 


TGAACTACGA 


ACGTGCTCGT 


2040 


CGTCCGGGTC 


TGCTGGGTGC 


TTCCGTTCTG 


GGTCTGGACG 


ACATCCACCG 


TGCTTGGCGT 


2100 


AGCTTGGTTC 


TGCGTGTTCG 


TGCTCAGGAC 


CCGCCGCCGG 


AACTGTACTT 


CGTTAAAGTT 


2160 


GACGTTACCG 


GCGCGTACGA 


CACCATCCCG 


CAGGACCGTC 


TGACCGAAGT 


TATCGCTTCC 


2220 


A7CATCAAAC 


CGCAGAACAC 


CTACTGCGTT 


CGTCGTTACG 


CTGTTGTTCA 


GAAAGCTGCT 


2280 


CACGGTC.ACG 


T7CGTAAAGC 


ATTCAAATCC 


CACGTTTCCA 


CCCTGACCGA 


CCTGCAGCCG 


23 4 0 


TACATGCGTC 


AGTTCGTTGC 


TCACCTGCAG 


GAAACCTCCC 


CGCTGCGTGA 


CGCTGTTGTT ' 


2400 


ATCGAACAGT 


CCTCCTCCCT 


GAACGAAGCT 


TCCTCCGGTC 


TGTTCGACGT 


TTTCCTGCGT 


24 60 


TTCATGTGCG 


ACCACGCTGT 


TCGTATCCGT 


GGTAAATCCT 


ACGTTCAGTG 


CCAGGGTATC 


2520 


GCGCAGGGTT 


CCATCCTGTC 


CACCCTGCTG 


TGCTCCCTGT 


GCTACGGTGA 


CATGGAAAAC 


2580 


AAA.CTGTTCG 


CTGGTATCCG 


TCGTGACGGT 


CTGCTGCTGC 


GTCTGGTTGA 


CGACTTCCTG- 


2640 


CTGGTTACTC 


CGCACCTGAC 


CCACGCTAAA 


ACCTTCCTGC 


GTACCCTGGT 


TCGTGGTGTT 


27 00 


CCGGAATACG 


GTTGCGTTGT 


AAACCTGCGT 


AAAACCGTTG 


TTAACTTCCC 


GGTTGAAGAC 


2 7 60 


GAAGCTCTGG 


GTGGCACCGC 


TTTCGTTCAG 


ATGCCGGCTC 


ACGGTCTGTT 


CCCGTGGTGC 


2820 


GGTCTGCTGC 


TGGACACCCG 


TACCCTGGAA 


GTTCAGTCCG 


ACTACTCCTC 


CTACGCTCGT 


2830 


ACCTCCATCC 


GTGCTTCCCT 


GACCTTCAAC 


CGTGGTTTCA 


. AAGCTGGTCG 


TAACATGCGT ■ 


2940 


CGTAAACTGT 


TCGGTGTTCT 


GCGTCTGAAA 


TGCCACTCCC 


TGTTCCTGGA 


CCTGCAGGTA 


3000 


AACTCCCTGC 


AGACCGTTTG 


CACCAACATC 


TACAAAATCC 


TGCTGCTGCA 


GGCTTACCGT 


3060 



416 



TTCCACGCGT 


GCGTTCTGCA 


GCTGCCGTTC 


CACCAGCAGG 


TTTGGAAAAA 


LLLbALLI 11. • 


3120 


TTCCTGCGTG 


TTATCTCCGA 


CACCGCTTCC 


CTGTGCTACT 


CCATCCTGAA 




3180 


GCTGGTATGT 


CCCTGGGTGC 


TAAAGGTGCT 


GCTCjb ILrOtjf- 




nuu loll v^rio 


3240 


TGGCTGTGCC 


ACCAGGCTTT 


CCTGCTGAAA 


CTGACCCGTC 


ACCGTGTTAC 


CTACGTTCCG 


3300 


CTGCTGGGTT 


CCCTGCGTAC 


CGCTCAGACC 


CAGCTGTCCC 


GTAAACTGCC 


GGGTACCACC 


3360 


CTGACCGCTC 


TGGAAGCTGC 


TGCTAACCCG 


GCTCTGCCGT 


CCGACTTCAA 


AACCATCCTG 


3420 


GACTAATGGA 


TCCGAATTCC 


TCGAGGCACG 


C 






3451 



(2) INFORMATION FOR SEQ ID NO: 722: 

( i ) SEQUENCE ^CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
(3} TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY - 

(3) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note« "antisense oligonucleotide 

corresponding to positions 
31-60 of hTRT" ' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 
GGCATCGCGG GGGTGGCCGG GGCCAGGGCT 



" (2) INFORMATION FOR SEQ ID NO: 7 23: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 30 base pairs 
(3) TYPE: nucleic acid 

(C) STRA.NDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

( A) NAME/KEY: - 

(B) LOCATION: 1 . . 30 

(D) OTHER INFORMATION: /note= "antisense oligonucleotide 

corresponding to positions 
496-525 of hTRT" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: 
GCGCAGCGTG CCAGCAGGTG AACCAGCACG 



30 



417 



(2) INFORMATION FOR SEQ ID NO: 724: 

(i) S-EQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: - 

(B) . LOCATION: 1. . 30 

(D) OTHER INFORMATION: /note* "antisense oligonucleotide 

corresponding to positions 
631-660 of hT RT " 



(xi) SEQUENCE DESCRI PT-ION : SEQ ID NO:724: 
GCCCGTTCGC ATCCCAGACG CCTTCGGGGT 



(2) INFORMATION FOR SEQ ID NO: 725.: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
■(C) STRANDEDNESS: .single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 



(ix) "FEATURE: ■ 

(A) NAME/ KEY : - 

(B) LOCATION: 1.".30 

(D) OTHER INFORMATION: /note= "antisense oligonucleotide 

corresponding to positions 
646-675 of hTRT" 



(xi) SEQUENCE DESCRIPTION:. SEQ ID NO: 725: 
ACGCTATGGT TCCAGGCCCG TTCGCATCCC 



(2) INFORMATION FOR SEQ ID NO:726: 

(i) SEQUENCE. CHARACTERISTICS : 

(A) LENGTH: 124 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : DMA 



418 



(ix) FEATURE : 

(A) NAME/KEY: - 

(B) LOCATION : 1 . . 124 

(D) OTHER INFORMATION : /note- "sequence present in pGRN176 but 

not pGRN175" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NQ:726: 

GTGGCGGAGG GACTGGGGAC CCGGGCACCG GTCCTGCCCC TTCACCTTCC AGCTCCGCCT 60 

CGTCCGCGCG GA.ACCCCGCC CCGTCCCGAA CCCTTCCCGG GTCCCCGGCC CAGCCCCTTC 12 0 

CGGG 12 4 

(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1040 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

tii) MOLECULE TYPE: peptide 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied- s i t e 
(3) LOCATION: 1 . . 300 

(D) OTHER. INFORMATION: /product- "OTHER" 

/ note = "Xaa at positions 1-300 may be 
present or absent" 

(ix) FEATURE: 

(A) NAME /KEY : Modi f ied- s i te . 
(3) LOCATION ; 630 * 
{ D) OTHER INFORMATION: /product* "OTHER" ■ 

/note* M Xaa at position 630 may be 

present or absent" 

{ix} FEATURE: 

(A) NAME/KEY: Modi f ied-si te 
(3) LOCATION: 649. . 663 

(D) OTHER INFORMATION: /product* "OTHER" 

/note* M Xaa at positions 649-663 may be 
present or absent" 

(ix) FEATURE: 

(A) NAlME/KEY: Modi f ied-si te 
(3) LOCATION: 674 . . 688 

{D} OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa at positions 674-688 may be 
present or- absent" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s i t e ■ 
(3) LOCATION: 701. . 706 

{ D) OTHER INFORMATION: /product* "OTHER" 

/note* "Xaa at positions 701-706 may be 
present or absent" 



4 19 



Ux) FEATURE: 

(A) NAME/KEY: Modif ied-site 

(B) LOCATION: 771. .790 

( D ) OTHER INFORMATION: /product- "OTHER" 

/note= "Xaa at positions 771-790 may be 
present or absent" 

(ix) FEATURE: 

(A) NAME/KEY: Modi f ied-s ite 

(B) LOCATION: 887 . . 937 

{DJ OTHER INFORMATION: /product= "OTHER" 

/note= "Xaa at positions 887-937 may be 
present or absent" 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 965,. 994 

'(D) OTHER INFORMATION: /product^ "OTHER" 

/note= "Xaa at positions 965-994 may be 
present or absent" 

(ix) FEATURE: 

(A) NAME/ KEY : Modi f ied-s i te 

(B) LOCATION: 1018.. 1027 

(D) OTHER INFORMATION:, /product— "OTHER" 

/note=* "Xaa at positions 1018-1027 may 
be present or absent" 



(xi) 'SEQUENCE DESCRIPTION: SEQ ID NO: 727: 

Xaa . Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

1 5 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

■ 20 '25" 30 . 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

35 ' " 40 45 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

5 0 - 55 60 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

65 ' 70 * 75 80 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

85 90 95 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

100 105 110 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

115 120 125 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 

130 135 - 140 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
145 150 155 160 



420 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
165 I 70 175 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
180 185 190 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
195 200 205 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
210 215 220 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
225 230 235 . 240 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
245 250 255 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
260 265 270 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa' Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
" 275 280 285 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
290 295 300 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
305 310 315 320 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
325 330 33= 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
340 3_4 5 3 = 0 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa .Xaa Xaa Xaa Xaa' Xaa Xaa Xaa 
355 360 365. 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
370 375 380 



Xaa 
385 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 



390 395 



Xaa Xaa Xaa .Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
405 4 -l° 415 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa. Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
420 425 "30 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
435 440 445 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
450 - 455 160 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
465 4^0 



475 480 



421 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
485 490 . 495 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
500 505 510 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
515 520 525 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
530 535 540 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa. Xaa Xaa Xaa Xaa 
545 550 555 560 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
565 570 575 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
580 585 590 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
595 600 ' 605 

Xaa Xaa Xaa Xaa Xaa Phe Phe Tyr Xaa Thr Glu Xaa Xaa- Xaa Xaa Xaa 
610 615 620 

Xaa Xaa Xaa Xaa Xaa Xaa Arg Xaa Xaa Xaa Trp Xaa Xaa Xaa Xaa Xaa 
625 630 635 640 

Xaa Xaa lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
645 650 655 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Glu Xaa Xaa Val Xaa Xaa Xaa Xaa Xaa 
660 665 670 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
675 630 635 

Xaa Xaa Xaa Arg Xaa Xaa Pro Lys Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
690 695 700 

Xaa Xaa Arg Xaa lie Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
705 710 -* 7 15 720 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
725 730 735 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
740 745 750 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
755 760 ■ 765 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
770 775 780 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Phe Xaa Xaa Xaa Asp Xaa 
785 790 795 800 



422 



Xaa Xaa Xaa Tyr Asp Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
805 810 815 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
820 825 830 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
835 840 845 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
850 855 860 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
865 870 875 880 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
885 8 90. 895 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
900 905 910 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
915 920 925 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Tyr Xaa Xaa Xaa Xaa Gly Xaa 
930 _ 935 940 

Xaa G 1 n Gly Xaa Xaa Xaa Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
945 950 955 960 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
965 970 975 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
930 935 4 990 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Asp Asp Xaa Leu Xaa Xaa Xaa Xaa 
995 1000 1005 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
1010 1015 1020 



Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Lys 
1025 1030 1035 1040 



